[Home ] [Archive]   [ فارسی ]  
:: Main :: About :: Current Issue :: Archive :: Search :: Submit :: Contact ::
:: Volume 3, Issue 3 (12-2016) ::
2016, 3(3): 205-213 Back to browse issues page
A Novel Method of Gene Expression Data Clustering
Davood Shahsavani , Zohreh Farhadi
Abstract:   (4088 Views)

Introduction: The microarray technology and production of gene expression data are among the important developments in genetic science that provide ability to study the behavior of thousands of genes, simultaneously.  Clustering is one of the most important data mining techniques used in gene expression data analysis. As, the performance of clustering methods is strongly affected by the structure of data, the result of clustering is always uncertain and there is no algorithm that can be used for all kinds of data. In this study, ensemble clustering (combined results of multiple clustering algorithms) was used for gene expression data analysis rather than using a single algorithm.

Methods: The performance of ensemble clustering in three gene expression data sets, Nutt-v3, Alizadeh-v2 and SU, were evaluated by adjusted Rand index. Twelve different clusterings resulted from the combination of four clustering algorithms with three dissimilarity matrices were simultaneously applied on data. After merging the results,and running the final clustering, the estimated clusters were compared with actual groups by the adjusted Rand index.

Results: The adjusted Rand index for the three data sets of Nutt-v3, Alizadeh-v2 and SU, were respectively 1, 0.9 and 0.58 which shows the remarkable accuracy of the proposed method in detecting patterns in data sets. Moreover, the designed algorithm could detect the actual number of clusters without errors.

Conclusion: Ensemble clustering is a powerful and reliable method for gene expression data analysis. Due to the accuracy and quality of this method in detection of real data structures, it can be replaced the individual clustering algorithms.

Keywords: Data mining, Ensemble clustering, Hierarchical clustering, Partition around medoids, Classic multidimensional scaling
Full-Text [PDF 836 kb]   (1079 Downloads)    
Type of Study: Original Article | Subject: General
Received: 2016/11/11 | Accepted: 2016/12/12
Send email to the article author

Add your comments about this article
Your username or Email:


XML   Persian Abstract   Print

Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Shahsavani D, Farhadi Z. A Novel Method of Gene Expression Data Clustering. Journal of Health and Biomedical Informatics. 2016; 3 (3) :205-213
URL: http://jhbmi.ir/article-1-153-en.html

Volume 3, Issue 3 (12-2016) Back to browse issues page
مجله انفورماتیک سلامت و زیست پزشکی Journal of Health and Biomedical Informatics
Persian site map - English site map - Created in 0.06 seconds with 32 queries by YEKTAWEB 4111