Home            Contact us            FAQs
    
      Journal Home      |      Aim & Scope     |     Author(s) Information      |      Editorial Board      |      MSP Download Statistics

     Research Journal of Applied Sciences, Engineering and Technology


An Improved Web Log Mining and Online Navigational Pattern Prediction

1D. Anandhi and 2M.S. Irfan Ahmed
1Department of Computer Technology and Applications, Coimbatore Institute of Technology
2Department of Computer Applications, Sri Krishna College of Engineering and Technology, Coimbatore-641008, India
Research Journal of Applied Sciences, Engineering and Technology  2014  12:1472-1479
http://dx.doi.org/10.19026/rjaset.8.1124  |  © The Author(s) 2014
Received: August ‎01, ‎2014  |  Accepted: September ‎22, ‎2014  |  Published: September 25, 2014

Abstract

The aim of this study is to improve web log mining and online navigation pattern prediction. Web mining is an active and wide area which incorporates several usages for the web site design, providing personalization server and other business making decisions etc. Efficient web log mining results and online navigational pattern prediction is a tough process due to vast development in web. It includes the process such as data cleaning, session identification and clustering of web logs generally. In this study initially the web log data is preprocessed and sessions are identified using refined time-out based heuristic for session identification. Then for pattern discovery a density based clustering algorithm is used. Finally for online navigation pattern prediction a new technique of SVM classification is used, which rectifies time complexity with increased prediction accuracy.

Keywords:

DBSCAN , optics , support vector machine , web mining,


References

  1. Berendt, B., B. Mobasher, M. Nakagawa and M. Spiliopoulou, 2003. The impact of site structure and user environment on session reconstruction in web usage analysis. In: Masand, B., M. Spiliopoulou, J. Srivastava and O.R. Zaiane (Eds.), WEBKDD 2002 Web Mining for Usage Patterns and User Profiles. LNAI 2703, Springer-Verlag, Berlin, Heidelberg, pp: 159-179.
    CrossRef    
  2. Borges, J. and M. Levene, 1999. Data mining of user navigation patterns. Proceeding of Revised Papers from the International Workshop on Web Usage Analysis and User Profiling (WEBKDD '99), pp: 31-39.
  3. Cooley, R., B. Mobasher and J. Srivastava, 1999. Data preparation for mining World Wide Web browsing patterns. Knowl. Inf. Syst., 1(1): 5-32.
    CrossRef    
  4. Etzioni, O., 1996. The World Wide Web: Quagmire or gold mine. Commun. ACM, 39(11): 65-68.
    CrossRef    
  5. Facca, F.M. and P.L. Lanzi, 2003. Recent developments in Web Usage Mining research. In: Kambayashi, Y., M. Mohania and W. Wob (Eds.), DaWaK 2003. LNCS 2737, Springer-Verlag, Berlin, Heidelberg, pp: 140-150.
    CrossRef    
  6. Guerbas, A., O. Addam, O. Zaarour, M. Nagi, A. Elhajj and M. Ridley, 2013. Effective web log mining and online navigational pattern prediction. Knowl-Based Syst., 49: 50-62.
    CrossRef    
  7. Huang, Y.F. and J.M. Hsu, 2008. Mining web logs to improve hit ratios of prefetching and caching. Knowl-Based Syst., 21(1): 62-69.
    CrossRef    
  8. Huang, Y.M., Y.H. Kuo, J.N. Chen and Y.L. Jeng, 2006. NP-miner: A real-time recommendation algorithm by using web usage mining. Knowl-Based Syst., 19(4): 272-286.
    CrossRef    
  9. Monreale, A., F. Pinelli, R. Trasarti and F. Giannotti, 2009. WhereNext: A location predictor on trajectory pattern mining. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD' 2009), pp: 637-646.
    CrossRef    
  10. Pabarskaite, Z. and A. Raudys, 2007. A process of knowledge discovery from web log data: Systematization and critical review. J. Intell. Inf. Syst., 28: 79-104.
    CrossRef    
  11. Reddy, B.G.O. and M. Ussenaiah, 2012. Literature survey on clustering techniques. IOSR J. Comput. Eng., 3(1): 01-12.
  12. Spiliopoulou, M., 1999. Data mining for the Web. Proceeding of 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD'99), pp: 588-589.
    CrossRef    
  13. Spiliopoulou, M., 2000. Web Usage Mining for Web site evaluation. Commun. ACM, 43(8): 127-134.
    CrossRef    
  14. Spiliopoulou, M., L.C. Faulstich and K. Winkler, 1999. A data miner analyzing the navigational behaviour of web users. Proceeding of the Workshop on Machine Learning in User Modelling of the ACAI99.
  15. Vapnik, V.N., 1995. The Nature of Statistical Learning Theory. Springer-Verlag, New York, Inc., New York, ISBN: 0-387-94559-8.
    CrossRef    PMid:8555380    
  16. Viswanath, P. and V.S. Babu, 2009. Rough-DBSCAN: A fast hybrid density based clustering method for large data sets. Pattern Recogn. Lett., 30(16): 1477-1488.
    CrossRef    
  17. Xing, F. and P. Guo, 2004. Classification of stellar spectral data using SVM. Proceeding of International Symposium on Neural Networks (ISNN'2004). LNCS 3173, Springer-Verlag, Berlin, Heidelberger, pp: 616-621.
    CrossRef    
  18. Yavas, G., D. Katsaros, Ö. Ulusoy and Y. Manolopoulos, 2005. A data mining approach for location prediction in mobile environments. Data Knowl. Eng., 54(2):121-146.
    CrossRef    
  19. Zheng, Y., L. Zhang, X. Xie and W.Y. Ma, 2009a. Mining correlation between locations using human location history. Proceeding of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS '09), pp: 472-475.
    CrossRef    
  20. Zheng, Y., L. Zhang, X. Xie and W.Y. Ma, 2009b. Mining interesting locations and travel sequences from GPS trajectories. Proceeding of the 18th International Conference on World Wide Web (WWW'2009), pp: 791-800.
    CrossRef    PMid:18849060    

Competing interests

The authors have no competing interests.

Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Copyright

The authors have no competing interests.

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Information
   Sales & Services
Home   |  Contact us   |  About us   |  Privacy Policy
Copyright © 2024. MAXWELL Scientific Publication Corp., All rights reserved