Research Article | OPEN ACCESS
Effective Sentiment Analysis for Opinion Mining Using Artificial Bee Colony Optimization
T.M. Saravanan and A. Tamilarasi
Department of Computer Applications, Kongu Engineering College, Perundurai, Erode,
Tamil Nadu 638052, India
Research Journal of Applied Sciences, Engineering and Technology 2016 8:828-840
Received: April 17, 2015 | Accepted: January 8, 2016 | Published: April 15, 2016
Abstract
Opinions play important role in the process of knowledge discovery or information retrieval and can be considered as a sub discipline of Data Mining. The huge quantity of information on web platforms put together feasible for exercise as data sources, in applications based on opinion mining and classification. An effective sentiment analysis process proposes in this research for mining and classifying the opinions. The phases of the proposed research are: (1) Data Pre-processing Phase (2) Potential Feature Extraction Phase (3) Opinion Extraction and Mining Phase and (4) Opinion Classification Phase. Initially, the datasets from various web documents get preprocessed and gives as part-of-speech tagged text. An Improved High Adjective Count (IHAC) Algorithm employs on the Part-Of-Speech tagged text to extract the potential features. Improved High Adjective Count Algorithm effectively optimizes the scores of the nouns to extract the potential features. An Artificial Bee Colony (ABC) Algorithm works under the IHAC algorithm for providing opinion scores and also for giving ranks for every noun. Max Opinion Score Algorithm can be then helpful to extract the opinion words followed by the classification phase, in which, ID3 algorithm utilizes to classify the review into three kinds positive, negative and neutral based on the opinions. The implementation is carried out on Customer Review Datasets and Additional Review Datasets with the aid of JAVA platform and also the experimentation results are analyzed.
Keywords:
Artificial bee colony algorithm, ID3 algorithm, improved high adjective count algorithm, max opinion score algorithm, opinion mining,
References
-
Chang, C.H., M. Kayed, M.R. Girgis and K.F. Shaalan, 2006. A survey of web information extraction systems. IEEE T. Knowl. Data En., 18(10): 1411-1428.
CrossRef -
Etzioni, O., M. Cafarella, D. Downey, A.M. Popescu, T. Shaked, S. Soderland, D.S. Weld and A. Yates, 2005. Unsupervised named-entity extraction from the web: An experimental study. Artif. Intell., 165(1): 91-134.
CrossRef -
Hong, J.L., 2010. Deep web data extraction. Proceeding of the IEEE International Conference on Systems Man and Cybernetics (SMC, 2010). Istanbul, pp: 3420-3427.
-
Jain, A., S. Jain, P. Shukla and H. Bandiya, 2012. Towards automatic detection of sentiments in customer reviews. Int. J. Inform. Sci. Tech., 2(4): 103.
CrossRef -
Kamal, A. and M. Abulaish, 2013. Statistical features identification for sentiment analysis using machine learning techniques. Proceeding of the International Symposium on Computational and Business Intelligence (ISCBI, 2013). New Delhi, pp: 178-181.
CrossRef -
Liu, K., L. Xu and J. Zhao, 2012. Opinion target extraction using word-based translation model. Proceeding of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL '12), pp: 1346-1356.
-
Meena, A. and T.V. Prabhakar, 2007. Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis. Proceeding of the 29th European Conference on IR Research (ECIR'07). Rome, Italy, pp: 573-580.
CrossRef -
Miao, Q., Q. Li and D. Zeng, 2010. Mining fine grained opinions by using probabilistic models and domain knowledge. Proceeding of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT, 2010). Toronto, ON, 1: 358-365.
CrossRef -
Miller, G.A., 1995. WordNet: A lexical database for English. Commun. ACM, 38(11): 39-41.
CrossRef -
Pang, B. and L. Lee, 2008. Opinion mining and sentiment analysis. Found. Trend. Inf. Retriev., 2(1-2): 1-135.
CrossRef -
Reed, S.L. and D.B. Lenat, 2002. Mapping ontologies into cyc. Proceeding of the AAAI Conference 2002 Workshop on Ontologies for the Semantic Web. Edmonton, Canada, pp: 1-6.
-
Smeureanu, I. and C. Bucur, 2012. Applying supervised opinion mining techniques on online user reviews. Informatica Econ., 16(2): 81.
-
Varelas, G., E. Voutsakis, P. Raftopoulou, E.G.M. Petrakis and E.E. Milios, 2005. Semantic similarity methods in wordnet and their application to information retrieval on the web. Proceeding of the 7th Annual ACM International Workshop on Web Information and Data Management, pp: 10-16.
CrossRef -
Vu, T.T., H.T. Pham, C.T. Luu and Q.T. Ha, 2011. A Feature-based Opinion Mining Model on Product Reviews in Vietnamese. In: Katarzyniak, R. et al. (Eds.), Semantic Methods for Knowledge Management and Communication. Studies in Computational Intelligence, Springer-Verlag, Berlin, Heidelberg, 381: 23-33.
CrossRef -
Wang, H. and S. Wang, 2008. A knowledge management approach to data mining process for business intelligence. Ind. Manage. Data Syst., 108(5): 622-634.
CrossRef -
Wong, T.L. and W. Lam, 2009. An unsupervised method for joint information extraction and feature mining across different Web sites. Data Knowl. Eng., 68(1): 107-125.
CrossRef -
Zhai, Y. and B. Liu, 2006. Structured data extraction from the web based on partial tree alignment. IEEE T. Knowl. Data En., 18(12): 1614-1627.
CrossRef -
Zhai, Z., B. Liu, H. Xu and P. Jia, 2011. Constrained LDA for grouping product features in opinion mining. Proceeding of the 15th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD'11), 1: 448-459.
CrossRef -
Zhang, L. and B. Liu, 2011. Identifying noun product features that imply opinions. Proceeding of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2: 575-580.
Competing interests
The authors have no competing interests.
Open Access Policy
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
The authors have no competing interests.
|
|
|
ISSN (Online): 2040-7467
ISSN (Print): 2040-7459 |
|
Information |
|
|
|
Sales & Services |
|
|
|