Volume 8, Issue 2 (9-2021)                   jhbmi 2021, 8(2): 193-207 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Saadi P, Zeinalnezhad M, Movahedi Sobhani F. Modeling and Predicting the Risk of Coronary Artery Disease Using Data Mining Algorithms. jhbmi 2021; 8 (2) :193-207
URL: http://jhbmi.ir/article-1-592-en.html
Ph.D. in Industrial Engineering, Assistant Professor, Industrial Engineering Dept., Faculty of Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran
Abstract:   (1803 Views)
Introduction: Coronary artery disease (CAD) is one of the most common causes of death in adults while accurate and early diagnosis can lead to treatment and survival of patients to a great extent. Therefore, the objective of this study was to identify the effective factors leading to this disease and develop a data-driven model to assist physicians in predicting and diagnosing it.
Method: This is an applied research, considering 2038 medical records, collected from Shahid Rajaei Heart Hospital in Tehran, during 5 years. A data preprocessing was carried out and random balanced sampling reduced the dataset into 1000 records, with 500 CAD and 500 Normal. Literature review, consultation with specialist physicians, and weighting using the Chi-square method led to the determination of important features. Support Vector Machine, Neural Network and Random Forest algorithms were applied in RapidMiner and Python.
Results: Among the 35 identified variables, the most important features included VHD, Chest pain, LDL, RWMA, TG, Na, K, BP, and weight. The F-measure, precision, accuracy, and recall for random forest algorithm were calculated as 82.11%, 81.40%, 79.07%, and 85.40%, respectively, and the error rate was 18.6%.
Conclusion: Random Forest predicted the risk of CAD with a reasonable precision. In comparison, due to the large number of input nodes, the error rate of the Neural Network model was relatively higher (23.6%).
Full-Text [PDF 1337 kb]   (1062 Downloads)    
Type of Study: Original Article | Subject: Data Mining
Received: 2021/05/9 | Accepted: 2021/08/23

Audio File [MP3 3992 KB]  (61 Download)
Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

© 2024 CC BY-NC 4.0 | Journal of Health and Biomedical Informatics

Designed & Developed by : Yektaweb