Research Article | OPEN ACCESS
Apriori Association Rule Algorithms using VMware Environment
1R. Sumithra, 2Sujni Paul and 3D. Ponmary Pushpa Latha
1School of Computer Science, CMS College, Coimbatore, India
2Department of MCA, OXFORD College of Engineering, Bangalore, India
3Department of Computer Applications, Karunya University, Coimbatore, India
Research Journal of Applied Sciences, Engineering and Technology 2014 2:160-166
Received: January 20, 2014 | Accepted: February 15, 2014 | Published: July 10, 2014
Abstract
The aim of this study is to carry out a research in distributed data mining using cloud platform. Distributed Data mining becomes a vital component of big data analytics due to the development of network and distributed technology. Map-reduce hadoop framework is a very familiar concept in big data analytics. Association rule algorithm is one of the popular data mining techniques which finds the relationships between different transactions. A work has been executed using weighted apriori and hash T apriori algorithms for association rule mining on a map reduce hadoop framework using a retail data set of transactions. This study describes the above concepts, explains the experiment carried out with retail data set on a VMW are environment and compares the performances of weighted apriori and hash-T apriori algorithms in terms of memory and time.
Keywords:
Association rules, cloud mining, hadoop , hash-T , map-reduce, W-apriori,
References
-
Agrawal, R. and R. Srikant, 1994. Fast algorithms for mining association rules. Proceedings of the 20th VLDB Conference (VLDB '94). Santiago, Chile, pp: 487-499.
CrossRef PMid:26298488
-
Grudzinski, P. and M. Wojciechowski, 2009. Integration of candidate hash trees in concurrent processing of frequent itemset queries using apriori. Control Cybern., 38(1).
-
Kambatla, K., A. Pathak and H. Pucha, 2009. Towards optimizing hadoop provisioning in the cloud. Proceedings of the 2009 Conference on Hot Topics in Cloud Computing (HotCloud'09), Article No. 22.
-
Li, J., P. Roy, S.U. Khan, L. Wang and Y. Bai, 2012. Data mining using clouds: An experimental implementation of apriori over mapreduce. Proceeding of the 12th IEEE International Conference on Salable Computing and Communication (ScalCom, 2102). Changzhou, China.
-
Paul, S. and V. Saravanan, 2008. Hash partitioned apriori in parallel and distributed data mining environment with dynamic data allocation approach. Proceeding of International Conference on Computer Science and Information Technology (ICCSIT '08), pp: 481-485.
CrossRef
-
Sun, K. and F. Bai, 2008. Mining weighted association rules without reassigned weights. IEEE T. Knowl. Data En., 20(4): 489-495.
CrossRef
-
Wang, W., J. Yang and P.S. Yu, 2000. Efficient mining of weighted association rules. Proceeding of ACM KDD 2000. Boston, MA, USA, pp: 270-274.
CrossRef
-
Yang, X.Y., L. Zhen and F. Yan, 2010. MapReduce as a programming model for association rules algorithm on hadoop. Proceeding of 3rd IEEE International Conference on Information Sciences and Interaction Sciences, pp: 99-102.
CrossRef
Competing interests
The authors have no competing interests.
Open Access Policy
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
The authors have no competing interests.
|
|
|
ISSN (Online): 2040-7467
ISSN (Print): 2040-7459 |
|
Information |
|
|
|
Sales & Services |
|
|
|