Ensemble clustering using extended fuzzy k-means for cancer data analysis

Imran Khan; Zongwei Luo; Abdul Khalique Shaikh; Rachid Hedjam

doi:10.1016/j.eswa.2021.114622

Ensemble clustering using extended fuzzy k-means for cancer data analysis

Imran Khan, Zongwei Luo, Abdul Khalique Shaikh, Rachid Hedjam

نتاج البحث: المساهمة في مجلة › Article › مراجعة النظراء

28 اقتباسات (Scopus)

ملخص

Clustering analysis is a significant research topic in discovering cancer using different profiles of gene expression, which is very important to successfully diagnose and treat the cancer decease. Many ensemble clustering methods have been developed to perform clustering using tumor data. Only few of them incorporates a significant number of input clusterings, the optimal number of clusters in each input clustering, and an appropriate ensemble method to combine input clusterings into a final clustering. In this paper, we introduce two new steps in the standard fuzzy k-means algorithm to determine the optimal number of input clusterings, and the optimal number of clusters in each clustering for ensemble clustering. The first one is to incorporate a penalty term for making the algorithm insensitive to the initialization of cluster centroids. The second one is to automate a clustering process for iteratively updating the feature weights. This step addresses the noise values in the dataset. We propose an ensemble clustering method, which combines a set of input clusterings into a final clustering having better overall quality. Experiments on real cancer gene expression profiles illustrate that the proposed algorithm outperformed the well-known clustering algorithms.

اللغة الأصلية	English
رقم المقال	114622
الصفحات (من إلى)	114622
عدد الصفحات	1
دورية	Expert Systems with Applications
مستوى الصوت	172
المعرِّفات الرقمية للأشياء	https://doi.org/10.1016/j.eswa.2021.114622
حالة النشر	Published - يونيو 15 2021

ASJC Scopus subject areas

???subjectarea.asjc.2200.2200???
???subjectarea.asjc.1700.1706???
???subjectarea.asjc.1700.1702???

أهداف الأمم المتحدة للتنمية المستدامة

يساهم هذا المخرج في تحقيق أهداف الأمم المتحدة للتنمية المستدامة التالية (SDGs)

الوصول إلى المستند

10.1016/j.eswa.2021.114622

الملفات والروابط الأخرى

قم بذكر هذا

@article{fb0b4da47b824a55add4811e3577cdc9,

title = "Ensemble clustering using extended fuzzy k-means for cancer data analysis",

abstract = "Clustering analysis is a significant research topic in discovering cancer using different profiles of gene expression, which is very important to successfully diagnose and treat the cancer decease. Many ensemble clustering methods have been developed to perform clustering using tumor data. Only few of them incorporates a significant number of input clusterings, the optimal number of clusters in each input clustering, and an appropriate ensemble method to combine input clusterings into a final clustering. In this paper, we introduce two new steps in the standard fuzzy k-means algorithm to determine the optimal number of input clusterings, and the optimal number of clusters in each clustering for ensemble clustering. The first one is to incorporate a penalty term for making the algorithm insensitive to the initialization of cluster centroids. The second one is to automate a clustering process for iteratively updating the feature weights. This step addresses the noise values in the dataset. We propose an ensemble clustering method, which combines a set of input clusterings into a final clustering having better overall quality. Experiments on real cancer gene expression profiles illustrate that the proposed algorithm outperformed the well-known clustering algorithms.",

keywords = "Cancer data, Cluster analysis, Fuzzy k-means, Variable weights",

author = "Imran Khan and Zongwei Luo and Shaikh, {Abdul Khalique} and Rachid Hedjam",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier Ltd",

year = "2021",

month = jun,

day = "15",

doi = "10.1016/j.eswa.2021.114622",

language = "English",

volume = "172",

pages = "114622",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Limited",

}

TY - JOUR

T1 - Ensemble clustering using extended fuzzy k-means for cancer data analysis

AU - Khan, Imran

AU - Luo, Zongwei

AU - Shaikh, Abdul Khalique

AU - Hedjam, Rachid

PY - 2021/6/15

Y1 - 2021/6/15

N2 - Clustering analysis is a significant research topic in discovering cancer using different profiles of gene expression, which is very important to successfully diagnose and treat the cancer decease. Many ensemble clustering methods have been developed to perform clustering using tumor data. Only few of them incorporates a significant number of input clusterings, the optimal number of clusters in each input clustering, and an appropriate ensemble method to combine input clusterings into a final clustering. In this paper, we introduce two new steps in the standard fuzzy k-means algorithm to determine the optimal number of input clusterings, and the optimal number of clusters in each clustering for ensemble clustering. The first one is to incorporate a penalty term for making the algorithm insensitive to the initialization of cluster centroids. The second one is to automate a clustering process for iteratively updating the feature weights. This step addresses the noise values in the dataset. We propose an ensemble clustering method, which combines a set of input clusterings into a final clustering having better overall quality. Experiments on real cancer gene expression profiles illustrate that the proposed algorithm outperformed the well-known clustering algorithms.

AB - Clustering analysis is a significant research topic in discovering cancer using different profiles of gene expression, which is very important to successfully diagnose and treat the cancer decease. Many ensemble clustering methods have been developed to perform clustering using tumor data. Only few of them incorporates a significant number of input clusterings, the optimal number of clusters in each input clustering, and an appropriate ensemble method to combine input clusterings into a final clustering. In this paper, we introduce two new steps in the standard fuzzy k-means algorithm to determine the optimal number of input clusterings, and the optimal number of clusters in each clustering for ensemble clustering. The first one is to incorporate a penalty term for making the algorithm insensitive to the initialization of cluster centroids. The second one is to automate a clustering process for iteratively updating the feature weights. This step addresses the noise values in the dataset. We propose an ensemble clustering method, which combines a set of input clusterings into a final clustering having better overall quality. Experiments on real cancer gene expression profiles illustrate that the proposed algorithm outperformed the well-known clustering algorithms.

KW - Cancer data

KW - Cluster analysis

KW - Fuzzy k-means

KW - Variable weights

UR - http://www.scopus.com/inward/record.url?scp=85100691973&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85100691973&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2021.114622

DO - 10.1016/j.eswa.2021.114622

M3 - Article

AN - SCOPUS:85100691973

SN - 0957-4174

VL - 172

SP - 114622

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 114622

ER -

Ensemble clustering using extended fuzzy k-means for cancer data analysis

ملخص

ASJC Scopus subject areas

أهداف الأمم المتحدة للتنمية المستدامة

الوصول إلى المستند

الملفات والروابط الأخرى

بصمة

قم بذكر هذا