Research Article | OPEN ACCESS
An Improved Web Log Mining and Online Navigational Pattern Prediction
1D. Anandhi and 2M.S. Irfan Ahmed
1Department of Computer Technology and Applications, Coimbatore Institute of Technology
2Department of Computer Applications, Sri Krishna College of Engineering and Technology, Coimbatore-641008, India
Research Journal of Applied Sciences, Engineering and Technology 2014 12:1472-1479
Received: August 01, 2014 | Accepted: September 22, 2014 | Published: September 25, 2014
Abstract
The aim of this study is to improve web log mining and online navigation pattern prediction. Web mining is an active and wide area which incorporates several usages for the web site design, providing personalization server and other business making decisions etc. Efficient web log mining results and online navigational pattern prediction is a tough process due to vast development in web. It includes the process such as data cleaning, session identification and clustering of web logs generally. In this study initially the web log data is preprocessed and sessions are identified using refined time-out based heuristic for session identification. Then for pattern discovery a density based clustering algorithm is used. Finally for online navigation pattern prediction a new technique of SVM classification is used, which rectifies time complexity with increased prediction accuracy.
Keywords:
DBSCAN , optics , support vector machine , web mining,
References
-
Berendt, B., B. Mobasher, M. Nakagawa and M. Spiliopoulou, 2003. The impact of site structure and user environment on session reconstruction in web usage analysis. In: Masand, B., M. Spiliopoulou, J. Srivastava and O.R. Zaiane (Eds.), WEBKDD 2002 Web Mining for Usage Patterns and User Profiles. LNAI 2703, Springer-Verlag, Berlin, Heidelberg, pp: 159-179.
CrossRef -
Borges, J. and M. Levene, 1999. Data mining of user navigation patterns. Proceeding of Revised Papers from the International Workshop on Web Usage Analysis and User Profiling (WEBKDD '99), pp: 31-39.
-
Cooley, R., B. Mobasher and J. Srivastava, 1999. Data preparation for mining World Wide Web browsing patterns. Knowl. Inf. Syst., 1(1): 5-32.
CrossRef -
Etzioni, O., 1996. The World Wide Web: Quagmire or gold mine. Commun. ACM, 39(11): 65-68.
CrossRef -
Facca, F.M. and P.L. Lanzi, 2003. Recent developments in Web Usage Mining research. In: Kambayashi, Y., M. Mohania and W. Wob (Eds.), DaWaK 2003. LNCS 2737, Springer-Verlag, Berlin, Heidelberg, pp: 140-150.
CrossRef -
Guerbas, A., O. Addam, O. Zaarour, M. Nagi, A. Elhajj and M. Ridley, 2013. Effective web log mining and online navigational pattern prediction. Knowl-Based Syst., 49: 50-62.
CrossRef -
Huang, Y.F. and J.M. Hsu, 2008. Mining web logs to improve hit ratios of prefetching and caching. Knowl-Based Syst., 21(1): 62-69.
CrossRef -
Huang, Y.M., Y.H. Kuo, J.N. Chen and Y.L. Jeng, 2006. NP-miner: A real-time recommendation algorithm by using web usage mining. Knowl-Based Syst., 19(4): 272-286.
CrossRef -
Monreale, A., F. Pinelli, R. Trasarti and F. Giannotti, 2009. WhereNext: A location predictor on trajectory pattern mining. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD' 2009), pp: 637-646.
CrossRef -
Pabarskaite, Z. and A. Raudys, 2007. A process of knowledge discovery from web log data: Systematization and critical review. J. Intell. Inf. Syst., 28: 79-104.
CrossRef -
Reddy, B.G.O. and M. Ussenaiah, 2012. Literature survey on clustering techniques. IOSR J. Comput. Eng., 3(1): 01-12.
-
Spiliopoulou, M., 1999. Data mining for the Web. Proceeding of 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD'99), pp: 588-589.
CrossRef -
Spiliopoulou, M., 2000. Web Usage Mining for Web site evaluation. Commun. ACM, 43(8): 127-134.
CrossRef -
Spiliopoulou, M., L.C. Faulstich and K. Winkler, 1999. A data miner analyzing the navigational behaviour of web users. Proceeding of the Workshop on Machine Learning in User Modelling of the ACAI99.
-
Vapnik, V.N., 1995. The Nature of Statistical Learning Theory. Springer-Verlag, New York, Inc., New York, ISBN: 0-387-94559-8.
CrossRef PMid:8555380 -
Viswanath, P. and V.S. Babu, 2009. Rough-DBSCAN: A fast hybrid density based clustering method for large data sets. Pattern Recogn. Lett., 30(16): 1477-1488.
CrossRef -
Xing, F. and P. Guo, 2004. Classification of stellar spectral data using SVM. Proceeding of International Symposium on Neural Networks (ISNN'2004). LNCS 3173, Springer-Verlag, Berlin, Heidelberger, pp: 616-621.
CrossRef -
Yavas, G., D. Katsaros, Ö. Ulusoy and Y. Manolopoulos, 2005. A data mining approach for location prediction in mobile environments. Data Knowl. Eng., 54(2):121-146.
CrossRef -
Zheng, Y., L. Zhang, X. Xie and W.Y. Ma, 2009a. Mining correlation between locations using human location history. Proceeding of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS '09), pp: 472-475.
CrossRef -
Zheng, Y., L. Zhang, X. Xie and W.Y. Ma, 2009b. Mining interesting locations and travel sequences from GPS trajectories. Proceeding of the 18th International Conference on World Wide Web (WWW'2009), pp: 791-800.
CrossRef PMid:18849060
Competing interests
The authors have no competing interests.
Open Access Policy
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
The authors have no competing interests.
|
|
|
ISSN (Online): 2040-7467
ISSN (Print): 2040-7459 |
|
Information |
|
|
|
Sales & Services |
|
|
|