Clustering with density based initialization and Bhattacharyya based merging

dc.contributor.authorKose, Erdem
dc.contributor.authorHocaoglu, Ali Koksal
dc.date.accessioned2025-10-29T11:07:48Z
dc.date.issued2022
dc.departmentGebze Teknik Üniversitesi
dc.description.abstractCentroid based clustering approaches, such as k-means, are relatively fast but inaccurate for arbitrary shape clusters. Fuzzy c-means with Mahalanobis distance can accurately identify clusters if data set can be modelled by a mixture of Gaussian distributions. However, they require number of clusters apriori and a bad initialization can cause poor results. Density based clustering methods, such as DBSCAN, overcome these disadvantages. However, they may perform poorly when the dataset is imbalanced. This paper proposes a clustering method, named clustering with density initialization and Bhattacharyya based merging based on the fuzzy clustering. The initialization is carried out by density estimation with adaptive bandwidth using k-Nearest Orthant-Neighb or algorithm to avoid the effects of imbalanced clusters. The local peaks of the point clouds constructed by the k-Nearest Orthant-Neighb or algorithm are used as initial cluster centers for the fuzzy clustering. We use Bhattacharyya measure and Jensen inequality to find overlapped Gaussians and merge them to form a single cluster. We carried out experiments on a variety of datasets and show that the proposed algorithm has remarkable advantages especially for imbalanced and arbitrarily shaped data sets.
dc.identifier.doi10.55730/1300-0632.3794
dc.identifier.endpage517
dc.identifier.issn1300-0632
dc.identifier.issn1303-6203
dc.identifier.issue3
dc.identifier.orcid0000-0002-6763-680X
dc.identifier.scopus2-s2.0-85128294991
dc.identifier.scopusqualityQ2
dc.identifier.startpage502
dc.identifier.trdizinid528661
dc.identifier.urihttps://doi.org/10.55730/1300-0632.3794
dc.identifier.urihttps://search.trdizin.gov.tr/tr/yayin/detay/528661
dc.identifier.urihttps://hdl.handle.net/20.500.14854/5074
dc.identifier.volume30
dc.identifier.wosWOS:000774599800003
dc.identifier.wosqualityQ4
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakTR-Dizin
dc.language.isoen
dc.publisherTubitak Scientific & Technological Research Council Turkey
dc.relation.ispartofTurkish Journal of Electrical Engineering and Computer Sciences
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WOS_20251020
dc.subjectInfinite mixture models
dc.subjectdensity estimation
dc.subjectJensen inequality
dc.subjectbandwidth selection
dc.subjectoptimal number of
dc.titleClustering with density based initialization and Bhattacharyya based merging
dc.typeArticle

Dosyalar