Research Article | OPEN ACCESS
Presentation Mining: An Overview of Information Extraction Systems
1Vinothini Kasinathan and 1, 2Aida Mustapha
1Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia
2Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia Parit Raja, 86400 Batu Pahat, Johor, Malaysia
Research Journal of Applied Sciences, Engineering and Technology 2015 3:308-314
Received: February 3, 2015 | Accepted: March 20, 2015 | Published: September 25, 2015
In education, scanning through endless slides in PowerPoint presentation is highly ineffective especially for the Digital Natives due to their multi-modal learning style. In order to cater for the high volume of information emerging from printed alphabets to digital images, this study proposes a text mining approach to extract keywords from a collection of presentation slides in a similar topic. This approach is to support the existing architecture of presentation mapping, whereby the keywords extracted would then be reconstructed visually in the form of visual knowledge display. In achieving this, this study provides a general discussion of text mining technologies available and later focuses on different keyword extraction systems. Finally, this study introduces the frontier method of this field, which is presentation mining.
Natural language processing, powerpoint, text mining,
- Chiang, C.C., J. Talburt, N. Wu, E. Pierce, C. Heien et al., 2008. A case study in partial parsing unstructured text. Proceeding of 5th International Conference on Information Technology: New Generations (ITNG, 2008), pp: 447-452.
CrossRef - Fan, W., L. Wallace, S. Rich and Z. Zhang, 2005. Tapping into the power of text mining. Proceeding of Commun. ACM, 49(2): 76-82.
- Frank, E. and O. Medelyan, 2009. KEA: Keyphrase Extraction Algorithm. (Online) University of Waikato (5.0). Retrieved form: HYPERLINK, (Accessed on: August 20, 2012).
Direct Link
- Grobelnik, M., D. Mladenic and M. Jermol, 2002. Exploiting text mining in publishing and education. Proceeding of the ICML-2002 Workshop on Data Mining Lessons Learned, Sydney, Australia.
- Gupta, V. and G.S. Lehal, 2009. A survey of text mining techniques and applications. J. Emerg. Technol. Web Intell., 1(1): 60-76.
CrossRef Direct Link
- Hsu, C.L. and Y.F. Chang, 2012. Qualitative text mining in student's service learning diary. Proceeding of 3rd International Conference on Innovations in Bio-Inspired Computing and Applications (IBICA), pp: 350-354.
Huang, H. and H. Wang, 2010. Keyphrases extraction research based on structure of document. Proceeding of 2nd International Conference on Education Technology and Computer (ICETC, 2010). Shanghai.
Kasinathan, V., A. Mustapha and M.F.C.A. Rani, 2013. Structure-based algorithm for presentation mapping in graphical knowledge display. Int. J. Inform. Educ. Technol., 3(2): 196-200.
- Kongkachandra, R. and K. Chamnongthai, 2008. Abductive reasoning for keyword recovering in semantic-based keyword extraction. Proceeding of 5th International Conference on Information Technology: New Generations (ITNG, 2008). Las Vegas, NV.
- Li, S., X. Lv, Q. Zhou and S. Shi, 2010. Study on key technology of topic tracking based on VSM. Proceeding of IEEE International Conference on Information and Automation (ICIA, 2010), pp: 2419-2423.
- Litvak, V., S.A. Ramsey, A.G. Rust, D.E. Zak, K.A. Kennedy, A.E. Lampano, M. Nykter, I. Shmulevich and A. Aderem, 2009. Function of C/EBPdelta in a regulatory circuit that discriminates between transient and persistent TLR4-induced signals. Nat. Immunol., 10: 437-443.
CrossRef PMid:19270711 PMCid:PMC2780024
Litvak, M., M. Last, H. Aizenman, I. Gobits and A. Kandel, 2011. DegExt: A language-independent graph-based keyphrase extractor. In: Mugellini, E., P.S. Szczepaniak, M.C. Pettenati and M. Sokhn (Eds.), Proceeding of the 7th Atlantic Web Intelligence Conference, AWIC 2011. Fribourg, Switzerland.
- Matsuo, Y. and M. Ishizuka, 2003. Keyword extraction from a single document using word co-occurrence statistical information. Proceeding of the 16th International Florida Artificial Intelligence Research Society Conference. St. Augustine, 2003. The AAAI Press.
- Milward, D. and J. Thomas, 2000. From information retrieval to information extraction. Proceeding of the ACL-2000 Workshop on Recent Advances in Natural Language Processing and Information Retrieval: Held in Conjunction with the 38th Annual Meeting of the Association for Computational Linguistics (RANLPIR '00), Vol. 11, Association for Computational Linguistics, Stroudsburg.
- Miner, G., J. Elder IV, A. Fast, T. Hill, R. Nisbet et al., 2012. Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications. 1st Edn., Elsevier Science, Burlington.
Mishra, A. and G. Singh, 2011. Improving keyphrase extraction by using document topic information. Proceeding of IEEE International Conference on Granular Computing (GrC). Kaohsiung.
Ogrenci, A.S., 2012. Empirical results about efforts for effective teaching to Y-generation freshman students. Proceeding of International Conference on Information Technology Based Higher Education and Training (ITHET). Istanbul.
Prensky, M., 2001. Digital natives, digital immigrants part 1. Horizon, 9(5): 1-6.
- Prensky, M., 2004. The emerging online life of the digital native. Retrieved form: (Accessed on: Jan, 13, 2009).
- Qi, Y., Y. Zhang and M. Song, 2009. Text mining for bioinformatics: State of the art review. Proceeding of the 2nd IEEE International Conference on Computer Science and Information Technology (ICCSIT, 2009). Beijing.
Qu, S., S. Wang and Y. Zou, 2008. Improvement of Text Feature Selection Method based on TFxIDF. Proceeding of the International Seminar on Future Information Technology and Management Engineering (FITME ’08). Leicestershire, United Kingdom.
Shi, T., S. Jiao, J. Hou and M. Li, 2008. Improving keyphrase extraction using wikipedia semantics. Proceeding of the 2nd International Symposium on Intelligent Information Technology Application (IITA '08). Shanghai.
- Thakkar, K.S., R.V. Dharaskar and M.B. Chandak, 2010. Graph-based algorithms for text summarization. Proceeding of the 3rd International Conference on Emerging Trends in Engineering and Technology (ICETET, 2010). Goa.
- Tufte, E., 2003. PowerPoint is evil. Wired Magazine, September.
- Wang, X.L., D.J. Mu and J. Fang, 2008. Improved automatic keyphrase extraction by using semantic information. Proceeding of the International Conference on Intelligent Computation Technology and Automation (ICICTA, 2008). Hunan.
Wang, X., J. Cao, Y. Liu, S. Gao and X. Deng, 2012. Text clustering based on the improved TFIDF by the iterative algorithm. Proceeding of the IEEE Symposium on Electrical and Electronics Engineering (EEESYM, 2012). Kuala Lumpur, pp: 140-143.
Wei, F., Y. He, W. Li and Q. Lu, 2008. A query-sensitive graph-based sentence ranking algorithm for query-oriented multi-document summarization. Proceeding of 2008 International Symposiums on Information Processing (ISIP, 2008). Moscow.
Xie, F., X. Wu and X. Hu, 2010. Keyphrase extraction based on semantic relatedness. Proceeding of 9th IEEE International Conference on Cognitive Informatics (ICCI, 2010). Beijing.
- Yong-Qing, W., L. Pei-Yu and Z. Zhen-Fang, 2008. A feature selection method based on improved TFIDF. Proceeding of 3rd International Conference on Pervasive Computing and Applications (ICPCA, 2008). Alexandria, pp: 94-97.
- Zhang, W., T. Yoshida and T. Xinjin, 2008. TFIDF, LSI and multi-word in information retrieval and text categorization. Proceeding of IEEE International Conference on Systems, Man and Cybernetics (SMC, 2008). Singapore, pp: 108-113.
- Zhang, X., Z. Guo and B. Li, 2009. An effective algorithm of news topic tracking. Proceeding of WRI Global Congress on Intelligent Systems (GCIS ’09). Xiamen, pp: 510-513.
Zhao, L., L. Yang and X. Ma, 2010. Using tag to help keyword extraction. Proceeding of International Conference on Computer and Information Application (ICCIA, 2010). Tianjin, pp: 95-98.
- Zhou, B., P. Luo, Y. Xiong and W. Liu, 2009. Wikipedia-graph based key concept extraction towards news analysis. Proceeding of IEEE Conference on Commerce and Enterprise Computing (CEC '09). Vienna, pp: 121-128.
Competing interests
The authors have no competing interests.
Open Access Policy
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
The authors have no competing interests.
ISSN (Online): 2040-7467
ISSN (Print): 2040-7459 |
Information |
Sales & Services |