Evaluation of Clustering Validity

Ghayda A.A. Al-Talib; Rudhwan Yousif Sideek

Evaluation of Clustering Validity

Section: Research Paper

Issue

Vol. 5 No. 2 (2008): Volume 5 Issue 2

Published

Jun 25, 2025

Pages

79-97

Abstract

Clustering is a mostly unsupervised procedure and the majority of the clustering algorithms depend on certain assumptions in order to define the subgroups present in a data set. As a consequence, in most applications the resulting clustering scheme requires some sort of evaluation as regards its validity.
In this paper, we present a clustering validity procedure, which evaluates the results of clustering algorithms on data sets. We define a validity indexes, S_Dbw & SD, based on well-defined clustering criteria enabling the selection of the optimal input parameters values for a clustering algorithm that result in the best partitioning of a data set.
We evaluate the reliability of our indexes experimentally, considering clustering algorithm (K_Means) on real data sets.
Our approach is performed favorably in finding the correct number of clusters fitting a data set.

Authors

Ghayda A.A. Al-Talib

Department of Computer Science, College of Computer Science and Mathematics, University of Mosul, Mosul, IRAQ

ORCID

Rudhwan Yousif Sideek

Technical College Technical Education Authority / Mosul

Identifiers

Download this PDF file

PDF

Statistics

How to Cite

A.A. Al-Talib, G., غیداء, Yousif Sideek, R., & رضوان. (2025). Evaluation of Clustering Validity. AL-Rafidain Journal of Computer Sciences and Mathematics, 5(2), 79–97. Retrieved from https://rjps.uomosul.edu.iq/index.php/csmj/article/view/19938