Home            Contact us            FAQs
    
      Journal Home      |      Aim & Scope     |     Author(s) Information      |      Editorial Board      |      MSP Download Statistics

     Research Journal of Applied Sciences, Engineering and Technology


Framework for Evaluating Feature Selection in Opinion Mining

1E.A. Neeba and 2S. Koteeswaran
1Department of Information Technology, Rajagiri School of Engineering and Technology, Kochi
2Department of Computer Science Engineering, Vel Tech Dr.RR & Dr.SR Technical University, India
Research Journal of Applied Sciences, Engineering and Technology  2016  3:202-208
http://dx.doi.org/10.19026/rjaset.13.2931  |  © The Author(s) 2016
Received: September ‎26, ‎2015  |  Accepted: October ‎30, ‎2015  |  Published: August 05, 2016

Abstract

Opinion mining is important in text mining applications in brand and product positioning, consumer attitude detection, customer relationship management and market research. Applications result in new generation companies, products for reputation management, online market perception and online content monitoring. Web expansion encourages users to contribute or express opinions through blogs, videos and social networking sites which provide information for sentiment analysis regarding a product or service. This study investigates various feature extraction methods performance and opinion mining classification algorithm. Evaluation is through the use of opinions from amazon.com with product reviews. Features extraction is from opinions using Term Document Frequency and Inverse Document Frequency (TDF×IDF). Feature transformation is through Principal Component Analysis (PCA) and kernel PCA. Feature selection techniques like Information Gain (IG), Mutual Information (MI) and Fisher Score select features. Extracted features are classified by Naïve Bayes, k Nearest Neighbour and Classification and Regression Trees (CART) classification algorithms.

Keywords:

Fisher Score (FS), Information Gain (IG), Mutual Information (MI), Naive Bayes, opinion mining, Principal Component Analysis (PCA) , TDF x IDFF,


References

  1. Bollegala, D., D. Weir and J. Carroll, 2013. Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE T. Knowl. Data En., 25(8): 1719-1731.
  2. Buche, A., D. Chandak and A. Zadgaonkar, 2013. Opinion mining and analysis: A survey. Int. J. Nat. Lang. Comput., 2(3): 39-48.
  3. Chen, J., Y. Liu, G. Zhang, Y. Cai, T. Wang and H. Min, 2013. Sentiment analysis for cantonese opinion mining. Proceeding of the 4th International Conference on Emerging Intelligent Data and Web Technologies (EIDWT, 2013). Xi'an, pp: 496-500.
  4. Cho, K.S., J.S. Ryu, J.H. Jeong, Y.H. Kim and U.M. Kim, 2010. Credibility evaluation and results with leader-weight in opinion mining. Proceeding of the International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC). Huangshan, pp: 5-8.
  5. Ding, J., Z. Le, P. Zhou, G. Wang and W. Shu, 2009. An opinion-tree based flexible opinion mining model. Proceeding of the International Conference on Web Information Systems and Mining (WISM, 2009). Shanghai, pp: 149-152.
  6. Ding, X., B. Liu and P.S. Yu, 2008. A holistic lexicon-based approach to opinion mining. Proceeding of the International Conference on Web Search and Data Mining (WSDM'08), pp: 231-240.
  7. Duan, J. and J. Zeng, 2013. Mining opinion and sentiment for stock return prediction based on web-forum messages. Proceeding of the 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD, 2013), pp: 984-988.
  8. Isabella, J. and R.M. Suresh, 2013. Analysis and evaluation of feature selectors in opinion mining. Indian J. Comput. Sci. Eng., 3(6): 757-762.
  9. Khan, F.H., S. Bashir and U. Qamar, 2014. TOM: Twitter opinion mining framework using hybrid classification scheme. Decis. Support Syst., 57: 245-257.
    CrossRef    Direct Link
  10. Liu, L., Z. Lv and H. Wang, 2012. Opinion mining based on feature-level. Proceeding of the 5th International Congress on Image and Signal Processing (CISP, 2012). Chongqing, pp: 1596-1600.
  11. Maas, A.L., R.E. Daly, P.T. Pham, D. Huang, A.Y. Ng and C. Potts, 2011. Learning word vectors for sentiment analysis. Proceeding of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 1: 142-150.
  12. Melville, P., W. Gryc and R.D. Lawrence, 2009. Sentiment analysis of blogs by combining lexical knowledge with text classification. Proceeding of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp: 1275-1284.
  13. Pak, A. and P. Paroubek, 2010. Twitter as a corpus for sentiment analysis and opinion mining. Proceeding of the 7th International Conference on Language Resources and Evaluation (LREC'10), pp: 1320-1326.
  14. Pe-alver-Martinez, I., F. Garcia-Sanchez, R. Valencia-Garcia, M.Á. Rodríguez-García, V. Moreno, A. Fraga and J.L. Sánchez-Cervantes, 2014. Feature-based opinion mining through ontologies. Expert Syst. Appl., 41(13): 5995-6008.
  15. Vinodhini, G. and R.M. Chandrasekaran, 2012. Sentiment analysis and opinion mining: A survey. Int. J. Adv. Res. Comput. Sci. Softw. Eng., 2(6).
  16. Vinodhini, G. and R.M. Chandrasekaran, 2014. Measuring the quality of hybrid opinion mining model for e-commerce application. Measurement, 55: 101-109.
    CrossRef    Direct Link
  17. Wu, H. and X. Gu, 2014. Reducing over-weighting in supervised term weighting for sentiment analysis. Proceeding of COLING 2014, 25th International Conference on Computational Linguistics: Technical Papers, Dublin, Ireland, pp: 1322-1330.
  18. Xia, R., C. Zong and S. Li, 2011. Ensemble of feature sets and classification algorithms for sentiment classification. Inform. Sciences, 181(6): 1138-1152.
    CrossRef    Direct Link
  19. Zhang, Z., Q. Ye, Z. Zhang and Y. Li, 2011. Sentiment classification of Internet restaurant reviews written in Cantonese. Expert Syst. Appl., 38(6): 7674-7682.
    CrossRef    Direct Link

Competing interests

The authors have no competing interests.

Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Copyright

The authors have no competing interests.

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Information
   Sales & Services
Home   |  Contact us   |  About us   |  Privacy Policy
Copyright © 2024. MAXWELL Scientific Publication Corp., All rights reserved