Cover Image

APPLICATION OF DECISION TREE CLASSIFIERS IN DIAGNOSING HEART DISEASE USING DEMOGRAPHIC DATA

*Dr P.Amirtharaj , Dr.Vaithiyanathan V, .

Abstract


The increase in sudden deaths due to heart disease is overwhelming. The decrease in the number of expert        doctors has lead to this problem to a large extent. In recent years, machine learning methods are widely used for            predictive analysis in Medical diagnosis. This paper proposes a medical decision support system for heart disease risk  classification using Decision Tree Classifier. Standard   Benchmark database of heart disease from University of  California, Irvine (UCI), literature from journals and expert discussions have contributed in designing the attribute set for Ischemic Heart disease for a demographic population in India. The data set is collected from cardio thoracic department, Madras Medical College, Chennai, India from January to April 2011. A total of 712 patients examined are classified into 4 classes using C 4.5 Decision Tree Classifier. Pre-processing steps filtered the data set to 662 patients. The accuracy of correctly classified instances is 82.33%. The same data set with 2 class output gives an accuracy of 92.56%. Whereas the accuracy of UCI Cleveland data set is 78.91%. . The main aim of is to develop more cost-effective and easy-to-use  systems, procedures and methods for supporting clinicians


Full Text:

PDF

References


http ://www. who. int/cardiovascular_diseases/en/

http://media.trb.com/media/acrobat/2012-01/285933360-30132728.pdf

http ://circ. ahaj ournals. org/content/125/ l/e2 .full

R. Detrano, A. Janosi, W. Steinbrunn, M. Pfisterer, J.J. Schmid, S. Sandhu, K.H.Guppy, S. Lee, and V. Froelicher. "International application of a new probability algorithm for the diagnosis of coronary artery disease". American Journal of Cardiology, Vol. 64, 1989, pp. 304-310

G. John. "Models if incremental concept formation". Journal of Atificial Intelligence, 1989, pp. 11-61.

L. Gamboa, M. G. Mendoza, J. M. Vargas, N. H. Gress, and R. E. Orozco, "Hybrid Fuzzy-SV Clustering for Heart Disease identification", in Proceedings of CIMCA- IAWTIC'06. 2006.

D. Resul, T. Ibrahim, S. Abdulkadir,. "Effective diagnosis of heart disease through neural Elsevier, 2008.

Z. Yao, P. Liu, L. Lei, and J. Yin, "R-C4.5 Decision tree model and its applications to health care dataset", in networks ensembles". Proceedings of the 2005 International Conference on Services Systems and Services Management. 2005. p. 1099- 1103.

K. Gang, P. Yi, S. Yong, C. Zhengxin,. "Privacy-preserving data mining of medical data using data separation- based techniques". Data science journal, Vol. 6, 2007.

http://heart-disease.emedtv.com/coronary-artery-disease/coronary-artery-disease.html

R. Gupta, V. P. Gupta, and N. S. Ahluwalia, 'Educational status, coronary heart disease, and coronary risk factor prevalence in a rural population of India', BMJ. 1994 November 19; 309(6965): 1332-1336

Panniyammakal Jeemon & K.S. Reddy, 'Social determinants of cardiovascular disease outcomes in Indians', Indian J Med Res 132, November 2010, pp 617-622.

A Mathavan, MD, A Chockalingam, PhD, S Chockalingam, BSc, B Bilchik, MD, and V Saini, MD, Madurai Area Physicians Cardiovascular Health Evaluation Survey (MAPCHES) - an alarming status', The Canadian Journal of Cardiology 2009 May; 25(5): 303-308.

Vamadevan S. Ajay & Dorairaj Prabhakaran, 'Coronary heart disease in Indians: Implications of the INTERHEART study', Indian J Med Res 132, November 2010, pp 561-566.

Reddy KS, Shah B, Varghese C, Ramadoss A. Responding to the threat of chronic diseases in India. Lancet 2005; 366 : 1744-9.

Gupta R, Joshi P, Mohan V, Reddy KS, Yusuf S. Epidemiology and causation of coronary heart disease and stroke in India. Heart 2008; 94 : 16-26

Prabhakaran D, Yusuf S, Mehta S, Pogue J, Avezum A, Budaj A, et al. Two-year outcomes in patients admitted with non- ST elevation acute coronary syndrome: results of the OASIS registry 1 and 2. Indian Heart J 2005; 57 : 217-25

Reddy KS, Prabhakaran D, Jeemon P, Thankappan KR, Joshi P, Chaturvedi V, et al. Educational status and cardiovascular risk profile in Indians. Proc Natl Acad Sci USA 2007; 104 : 16263-8.

Ajay VS, Prabhakaran D, Jeemon P, Thankappan KR, Mohan V, Ramakrishnan L, et al. Prevalence and determinants of diabetes mellitus in the Indian Industrial population. Diabetic Med 2008; 25 : 1187-94.

Puska P, Tuomilehto J, Aulikki N, Enkki V. The North Karelia Project. 20 years results and experiences. Helsinki: National Public Health Institute; 1995.

Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J., 1998. UCI repository of machine learning databases. Department of Information and Computer Science, University California Irvine

Zhu, W., Zeng, N., Wang, N., 2010. Sensitivity, specificity, accuracy, associated confidence interval and roc analysis with practical SAS implementations. In: NESUG Proceedings: Health Care and Life Sciences, Baltimore, Maryland

http://www.public.asu.edu/~ltang9/papers/ency- cross-validation.pdf

McLachlan, Geoffrey J.; Do, Kim-Anh; Ambroise, Christophe (2004). Analyzing microarray gene expression data. Wiley.

Geisser, Seymour (1993). Predictive Inference. New York, NY: Chapman and Hall. ISBN 0-412-03471-9

Jiawei Han, Micheline Kamber, 'Datamining: Concepts and Techniques', 2nd Edition, Morgan Kaufmann Publishers, March 2006. ISBN 1-55860-901-6


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

An Initiative of The Tamil Nadu Dr MGR Medical University