Home            Contact us            FAQs
    
      Journal Home      |      Aim & Scope     |     Author(s) Information      |      Editorial Board      |      MSP Download Statistics

     Research Journal of Applied Sciences, Engineering and Technology


Analysis of Data Mining Dataset Using Fuzzy Based Unnested Select SQL Queries

1D. Veni and 2K.R. Chandran
1Anna University,
2Information Technology Department, PSG College of Technology, Coimbatore, Tamilnadu, India
Research Journal of Applied Sciences, Engineering and Technology   2015  3:267-273
http://dx.doi.org/10.19026/rjaset.11.1716  |  © The Author(s) 2015
Received: December ‎10, ‎2014  |  Accepted: February ‎5, ‎2015  |  Published: September 25, 2015

Abstract

The aim of this study is to improve the existing traditional databases. Some new techniques have been involved to handle the imprecise or uncertain information from the dataset. Dataset is prepared and used to analyze the data mining project which consume more time and need many complex queries. Nested query is predominant method to handle complex queries. Execution of a nested query may cause the heavy performance penalty. The main objective of this study is to reduce the heavy performance penalty of nested queries by using the unnested queries. The unnested queries produce the equivalent output as nested queries with minimum penalty and execution time. Success of unnested queries are examined using join algorithms. It is more efficient than the nested-loop algorithms which are used to evaluate the nested queries. In this study, unnested queries are used to analysis the data-mining project in dataset, we get the result from combining fuzzy set theory. In experimental results, we have shown that the performance of evaluating the unnesting techniques with extended merge-join and horizontal aggregations techniques CASE, SPJ and PIVOT in dataset. Thus, unnested queries improve the performance of execution and linear scalability.

Keywords:

Fuzzy queries, nested queries, PIVOT, set exclusive operator,


References

  1. Baldwin, J.F., 1983. A fuzzy relational inference language for expert systems. Proceeding of the 13th IEEE International Symposium Multiple-valued Logic, pp: 416-423.
  2. Codd, E.F., 1979. Extending the database relational model to capture more meaning. ACM T. Database Syst., 4(4): 397-434.
    CrossRef    
  3. Gray, J., A. Bosworth, A. Layman and H. Pirahesh, 1996. Data cube: A relational aggregation operator generalizing group-by, cross-tab and subtotals. Proceeding of the 12th International Conference on Data Engineering (ICDE, 1996), pp: 152-159.
    CrossRef    
  4. Kim, W., 1982. On optimizing an SQL-like nested query. ACM T. Database Syst., 7(3): 443-469.
    CrossRef    
  5. Lacroix, M. and A. Pirotte, 1976. Generalized joins. SIGMOD Rec., 8(3).
    CrossRef    
  6. Lohman, G.M., D. Daniels, L.M. Haas, R. Kistler and P.G. Selinger, 1984. Optimization of nested queries in a distributed relational database. Proceedings of the 10th International Conference on Very Large Data Bases (VLDB'84), pp: 403-415.
  7. Ordonez, C., 2004. Vertical and horizontal percentage aggregations. Proceeding of the ACM SIGMOD International Conference on Management of Data (SIGMOD'04), pp: 866-871.
    CrossRef    
  8. Ordonez, C., 2010. Statistical model computation with UDFs. IEEE T. Knowl. Data En., 22(12): 1752-1765.
    CrossRef    
  9. Ordonez, C. and Z. Chen, 2012. Horizontal aggregations in SQL to prepare data sets for data mining analysis. IEEE T. Knowl. Data En., 24(4):678-691.
    CrossRef    
  10. Qi, Y., Z. Weining, L. Chengwen, W. Jing, C. Yu et al., 2001. Efficient processing of nested fuzzy SQL queries in a fuzzy database. IEEE T. Knowl. Data En., 13(6).
  11. Rosenthal, A. and D.S. Reiner, 1984. Extending the algebraic framework of query processing to handle outerjoins. Proceeding of the 10th International Conference on Very Large Data Bases (VLDB'84), pp: 334-343.
  12. Wang, H., C. Zaniolo and C.R. Luo, 2003. ATLaS: A small but complete SQL extension for data mining and data streams. Proceeding of the 29th International Conference on Very Large Data Bases (VLDB'03), 29: 1113-1116.
    CrossRef    

Competing interests

The authors have no competing interests.

Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Copyright

The authors have no competing interests.

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Information
   Sales & Services
Home   |  Contact us   |  About us   |  Privacy Policy
Copyright © 2024. MAXWELL Scientific Publication Corp., All rights reserved