Research Article | OPEN ACCESS
An Efficient Sentence-based Sentiment Analysis for Expressive Text-to-speech using Fuzzy Neural Network
B. Sudhakar and R. Bensraj
Department of Electrical Engineering, Annamalai University, College Rd, Annamalai Nagar, Chidambaram, Tamil Nadu 608002, India
Research Journal of Applied Sciences, Engineering and Technology 2014 3:378-386
Received: March 14, 2014 | Accepted: May 04, 2014 | Published: July 15, 2014
Abstract
In recent years, speech processing has become an active research area in the field of signal processing due to the usage of automated systems for spoken language interface. In developed countries, the customer service with automated system in speech synthesis has been the recent trend. The existing automated speech synthesis systems have certain problems during the real time implementation such as lack of naturalness in output speech, lack of emotions and so on. In this study, the novel Text to Speech system is introduced along with the sentiment analysis in Tamil language. The input text is first classified into the positive, negative and neutral based on the emotions in the sentence then the text is converted into speech with emotions during TTS conversion. Existing approaches used neural network based classifiers for classification. But, neural networks have certain drawbacks in real time training. So, this research study uses Fuzzy Neural Network (FNN) to classify the sentence based on the emotions. The text to speech with sentiment analysis effective scheme which is evaluated using Doordarshan news Tamil dataset. The proposed scheme is implemented using MATLAB. This TTS system has several social applications, especially in railway stations where the announcements can be made through expressive speech.
Keywords:
Another Tool for Language Recognition (ANTLR), Natural Language Processor (NLP), Text to Speech (TTS),
References
-
Alexandre, T. and A. Francesc, 2009. Sentiment classi?cation in English from sentence-level annotations of emotions regarding models of affect. Proceeding of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH, 2009). Brighton, United Kingdom, pp: 516-519.
-
Alias, F., X. Sevillano, J.C. Socoro and X. Gonzalvo, 2008. Towards high-quality next-generation text-to-speech synthesis: A multidomain approach by automatic domain classification. IEEE T. Audio Speech, 16(7): 1340-1354.
CrossRef -
Alm, C.O., D. Roth and R. Sproat, 2005. Emotions from text: Machine learning for text-based emotion prediction. Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp: 579-586.
CrossRef -
Baggia, P., P. Bagshaw, M. Bodell, D.Z. Huang, L. Xiaoyan, S. McGlashan, J. Tao, Y. Jun, H. Fang, Y. Kang, H. Meng, W. Xia, X. Hairong and Z. Wu, 2010. Speech Synthesis Markup Language (SSML) Version 1.1, W3C, Tech.
Direct Link -
Baskaran, S. and K. Vijay-Shanker, 2003. Influence of morphology in word sense disambiguation for Tamil. Proceeding of the International Conference on Natural Language Processing, Mysore.
-
Bellegarda, J.R., 2011. A data-driven affective analysis framework toward naturally expressive speech synthesis. IEEE T. Audio Speech, 19(5): 1113-1122.
CrossRef -
Bing, L., 2012. Sentiment Analysis and Opinion Mining. Morgan and Claypool Publishers, San Rafael, California.
-
Bradley, M.M. and P.J. Lang, 1999. Affective Norms for English Words (ANEW): Instruction manual and affective ratings. Technical Report C-1, Center for Research in Psychophysiology, University of Florida, Gainesville, FL, pp: 1-45.
-
Chalamandaris, A., S. Karabetsos, P. Tsiakoulis and S. Raptis, 2010. A unit selection text-to-speech synthesis system optimized for use with screen readers. IEEE T. Consum. Electr., 56(3): 1890-1897.
CrossRef -
Chen, Y.C. and C.C. Teng, 1995. A model reference control structure using a fuzzy neural network. Fuzzy Set. Syst., 73: 291-312.
CrossRef -
Dang, Y., Y. Zhang and H. Chen, 2010. A lexicon-enhanced method for sentiment classification: An experiment on online product reviews. IEEE Intell. Syst., 25(4): 46-53.
CrossRef -
Dipasree, P., M. Mandar and D. Kalyankumar, 2013. Query expansion using term distribution and term association. Inform. Retrieval, 1: 1-19.
-
Francisco, V., R. Hervás, F. Peinado and P. Gervs, 2011. EmoTales: Creating a corpus of folk tales with emotional annotations. Lang. Resour. Eval., 45: 1-41.
-
Hu, M. and L. Bing, 2004. Mining and summarizing customer reviews. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD, 2004), pp: 168-177.
CrossRef -
Kosko, B., 1992. Neural Networks and Fuzzy Systems: A Dynamical Systems Approach to Machine Intelligence. Prentice Hall, Englewood Cliffs, NJ.
-
Lin, F.J., W.J. Hwang and R.J. Wai, 1999. A supervisory fuzzy neural network control system for tracking periodic inputs. IEEE T. Fuzzy Syst., 7(1): 41-52.
CrossRef -
Manning, C.D., P. Raghavan and H. Schütze, 2008. Introduction to Information Retrieval. Cambridge University Press, Cambridge, MA.
CrossRef -
Pang, B. and L. Lee, 2008. Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval, 2(1/2): 1-135.
CrossRef -
Pang, B., L. Lee and S. Vaithyanathan, 2002. Thumbs up? Sentiment classification using machine learning techniques. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP, 2002), pp: 79-86.
-
Rudy, P. and T. Mike, 2009. Sentiment analysis: A combined approach. J. Informetr., 3(2): 143-157.
CrossRef -
Schröder, M., H. Pirker, M. Lamolle, F. Burkhardt, C. Peter and E. Zovato, 2011. Representing Emotions and Related States in Technological Systems. In: Petta, P., R. Cowie and C. Pelachaud (Eds.), Emotion-Oriented Systems-the Humaine Handbook. Cognitive Technologies, Springer, Heidelberg, pp: 367-386.
CrossRef -
Sebastiani, F., 2002. Machine learning in automated text categorization. ACM Comput. Surv., 34: 1-47.
CrossRef -
Sebastiani, F., 2005. Text Categorization. In: Zanasi, A. (Ed.), Text Mining and its Applications to Intelligence, CRM and Knowledge Management. WIT Press, Southampton, UK, pp: 109-129.
CrossRef -
Tiomkin, S., D. Malah and S. Shechtman, 2010. Statistical text-to-speech synthesis based on segment-wise representation with a norm constraint. IEEE T. Audio Speech, 18(5): 1077-1082.
CrossRef -
Trilla, A., F. Alías and I. Lozano, 2010. Text classification of domain-styled text and sentiment-styled text for expressive speech synthesis. Proceeding of the 6th Jornadas en Tecnología del Habla (FALA, 2010), pp: 75-78.
-
Turney, P.D., 2002. Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. Proceeding of the 40th Annual Meeting of the Association for Computational Linguistics (ACL, 2002), pp: 417-424.
Competing interests
The authors have no competing interests.
Open Access Policy
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
The authors have no competing interests.
|
|
|
ISSN (Online): 2040-7467
ISSN (Print): 2040-7459 |
|
Information |
|
|
|
Sales & Services |
|
|
|