Home           Contact us           FAQs           
     Journal Home     |     Aim & Scope    |    Author(s) Information      |     Editorial Board     |     MSP Download Statistics
2017 (Vol. 14, Issue: 2)
Research Article

Unsupervised Discretization: An Analysis of Classification Approaches for Clinical Datasets

1M. Shanmugapriya, 1H.Khanna Nehemiah, 1R.S. Bhuvaneswaran, 2Kannan Arputharaj and 1J. Jabez Christopher
1Ramanujan Computing Centre
2Department of Information Science and Technology, Anna University, Chennai-600025, India

DOI: 10.19026/rjaset.14.3991
Submitted Accepted Published
June ‎28, ‎2016 August ‎9, ‎2016 February 15, 2017

  How to Cite this Article:

1M. Shanmugapriya, 1H.Khanna Nehemiah, 1R.S. Bhuvaneswaran, 2Kannan Arputharaj and 1J. Jabez Christopher, 2017. Unsupervised Discretization: An Analysis of Classification Approaches for Clinical Datasets.  Research Journal of Applied Sciences, Engineering and Technology, 14(2): 67-72.

DOI: 10.19026/rjaset.14.3991

URL: http://www.maxwellsci.com/jp/mspabstract.php?jid=RJASET&doi=rjaset.14.3991


Discretization is a frequently used data preprocessing technique for enhancing the performance of data mining tasks in knowledge discovery from clinical data. It is used to transform the real-world quantitative data into qualitative data. The aim of this study is to present an experimental analysis of the variation in performance of two trivial unsupervised discretization methods with respect to different classification approaches. Equal width discretization and equal frequency discretization methods are applied for four benchmark clinical datasets obtained from the University of California, Irvine, machine learning repository. Both the methods were applied for transforming quantitative attributes into qualitative attributes with three, five, seven and ten intervals. Six classification approaches were evaluated using four evaluation measures. From the results of this experimental analysis, it can be observed that there is a variation in the performance of classification algorithms. Accuracy of classification varies with respect to the discretization method used and also with respect to the number of intervals of discretization. Moreover it can be inferred that different classification approaches require different discretization methods. No method can be deemed to be ‘the best-suitable’ for all applications; hence the choice of an appropriate discretization method depends on data distribution, data interpretability, correlation, classification performance and domain of application.

Abstract PDF HTML

    Competing interests

The authors have no competing interests.
    Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.


© The Author(s) 2017

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Current Information
   Sales & Services
Home  |  Contact us  |  About us  |  Privacy Policy
Copyright © 2015. MAXWELL Scientific Publication Corp., All rights reserved