RT - Journal Article T1 - The investigation of TB patients features with K-Means clustering JF - jhbmi YR - 2015 JO - jhbmi VO - 2 IS - 3 UR - http://jhbmi.ir/article-1-110-en.html SP - 149 EP - 159 K1 - Tuberculosis K1 - Clustering K1 - Association rules K1 - Data mining AB - Introduction: According to the World Health Organization, TB is the largest cause of death among infectious diseases. Due to the high percentage of tuberculosis infection and the high number of death among these patients, this study was carried out to categorized and find the relationship between different clinical and demographical characteristics. Method: This descriptive analytical study was done on 600 patients from Masih Daneshvari hospital tuberculosis research center. K-means clustering, Apriori association rules, and data mining algorithms (SPSS Clementine software) were used for clustering and determining the common characteristics among patients. Results: Based on DUNN index, 3 clusters were chosen as optimal cluster. The common factors between clusters have been described in details in findings section. According to the characteristics of each cluster, patients can be classified based on the effectiveness of various factors Conclusion: According to the results of this study, the most important identified factors by the use of clustering are Hemoglobin, age, sex, smoking, alcohol and Creatinine. Based on the association rules the highest rate of relationship is found between cough, weight loss, and ESR. LA eng UL http://jhbmi.ir/article-1-110-en.html M3 ER -