A novel three staged clustering algorithm

Jamil Al-Shaqsi*, Wenjia Wang

*المؤلف المقابل لهذا العمل

نتاج البحث: Conference contribution

4 اقتباسات (Scopus)

ملخص

This paper presents a novel three staged clustering algorithm and a new similarity measure. The main objective of the first stage is to create the initial clusters, the second stage is to refine the initial clusters, and the third stage is to refine the initial BASES, if necessary. The novelty of our algorithm originates mainly from three aspects: automatically estimating k value, a new similarity measure and starting the clustering process with a promising BASE. A BASE acts similar to a centroid or a medoid in common clustering method but is determined differently in our method. The new similarity measure is defined particularly to reflect the degree of the relative change between data samples and to accommodate both numerical and categorical variables. Moreover, an additional function has been devised within this algorithm to automatically estimate the most appropriate number of clusters for a given dataset. The proposed algorithm has been tested on 3 benchmark datasets and compared with 7 other commonly used methods including TwoStep, k-means, k-modes, GAClust, Squeezer and some ensemble based methods including k-ANMI. The experimental results indicate that our algorithm identified the appropriate number of clusters for the tested datasets and also showed its overall better clustering performance over the compared clustering algorithms.

اللغة الأصليةEnglish
عنوان منشور المضيفProceedings of the IADIS European Conference on Data Mining 2009, ECDM'09 Part of the IADIS Multi Conference on Computer Science and Information Systems, MCCSIS 2009
الصفحات19-26
عدد الصفحات8
حالة النشرPublished - 2009
منشور خارجيًانعم
الحدثIADIS European Conference on Data Mining 2009, ECDM'09. Part of the IADIS Multi Conference on Computer Science and Information Systems, MCCSIS 2009 - Algarve, Portugal
المدة: يونيو ١٨ ٢٠٠٩يونيو ٢٠ ٢٠٠٩

سلسلة المنشورات

الاسمProceedings of the IADIS European Conference on Data Mining 2009, ECDM'09 Part of the IADIS Multi Conference on Computer Science and Information Systems, MCCSIS 2009

Other

OtherIADIS European Conference on Data Mining 2009, ECDM'09. Part of the IADIS Multi Conference on Computer Science and Information Systems, MCCSIS 2009
الدولة/الإقليمPortugal
المدينةAlgarve
المدة٦/١٨/٠٩٦/٢٠/٠٩

ASJC Scopus subject areas

  • ???subjectarea.asjc.1700.1702???
  • ???subjectarea.asjc.1700.1707???
  • ???subjectarea.asjc.1700.1712???

بصمة

أدرس بدقة موضوعات البحث “A novel three staged clustering algorithm'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا