Clustering with density based initialization and Bhattacharyya based merging
| dc.contributor.author | Kose, Erdem | |
| dc.contributor.author | Hocaoglu, Ali Koksal | |
| dc.date.accessioned | 2025-10-29T11:07:48Z | |
| dc.date.issued | 2022 | |
| dc.department | Gebze Teknik Üniversitesi | |
| dc.description.abstract | Centroid based clustering approaches, such as k-means, are relatively fast but inaccurate for arbitrary shape clusters. Fuzzy c-means with Mahalanobis distance can accurately identify clusters if data set can be modelled by a mixture of Gaussian distributions. However, they require number of clusters apriori and a bad initialization can cause poor results. Density based clustering methods, such as DBSCAN, overcome these disadvantages. However, they may perform poorly when the dataset is imbalanced. This paper proposes a clustering method, named clustering with density initialization and Bhattacharyya based merging based on the fuzzy clustering. The initialization is carried out by density estimation with adaptive bandwidth using k-Nearest Orthant-Neighb or algorithm to avoid the effects of imbalanced clusters. The local peaks of the point clouds constructed by the k-Nearest Orthant-Neighb or algorithm are used as initial cluster centers for the fuzzy clustering. We use Bhattacharyya measure and Jensen inequality to find overlapped Gaussians and merge them to form a single cluster. We carried out experiments on a variety of datasets and show that the proposed algorithm has remarkable advantages especially for imbalanced and arbitrarily shaped data sets. | |
| dc.identifier.doi | 10.55730/1300-0632.3794 | |
| dc.identifier.endpage | 517 | |
| dc.identifier.issn | 1300-0632 | |
| dc.identifier.issn | 1303-6203 | |
| dc.identifier.issue | 3 | |
| dc.identifier.orcid | 0000-0002-6763-680X | |
| dc.identifier.scopus | 2-s2.0-85128294991 | |
| dc.identifier.scopusquality | Q2 | |
| dc.identifier.startpage | 502 | |
| dc.identifier.trdizinid | 528661 | |
| dc.identifier.uri | https://doi.org/10.55730/1300-0632.3794 | |
| dc.identifier.uri | https://search.trdizin.gov.tr/tr/yayin/detay/528661 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14854/5074 | |
| dc.identifier.volume | 30 | |
| dc.identifier.wos | WOS:000774599800003 | |
| dc.identifier.wosquality | Q4 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.indekslendigikaynak | TR-Dizin | |
| dc.language.iso | en | |
| dc.publisher | Tubitak Scientific & Technological Research Council Turkey | |
| dc.relation.ispartof | Turkish Journal of Electrical Engineering and Computer Sciences | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.snmz | KA_WOS_20251020 | |
| dc.subject | Infinite mixture models | |
| dc.subject | density estimation | |
| dc.subject | Jensen inequality | |
| dc.subject | bandwidth selection | |
| dc.subject | optimal number of | |
| dc.title | Clustering with density based initialization and Bhattacharyya based merging | |
| dc.type | Article |









