Clustering is a technique used to partition a dataset into groups of similar elements. In addition to tradi- tional clustering methods, clustering for probability density functions (CDF) has been studied to capture data uncertainty. In CDF, automatic clustering is a clever technique that can determine the number of clusters automatically. However, current automatic clus- tering algorithms update the new probability density function (pdf) fi(t) based on the weighted mean of all previous pdfs fj(t − 1), j = 1, 2, ... , N, resulting in slow convergence. This paper proposes an efficient automatic clustering algorithm for pdfs. In the proposed approach, the update of fi(t) is based on the weighted mean of {f1(t), f2(t), ... , fi−1(t), fi(t − 1), fi+1(t − 1), ... , fN (t − 1)}, where N is the number of pdfs and i=1,2,...,N. This technique allows for the incorporation of recently updated pdfs, leading to faster convergence. This paper also pioneers the applications of certain CDF algorithms in the field of surface image recognition. The numerical examples demonstrate that the proposed method can result in a rapid convergence at some early iterations. It also outperforms other state-of-the-art automatic clus- tering methods in terms of the Adjusted Rand Index and the Normalized Mutual Information. Addition- ally, the proposed algorithm proves to be competitive when clustering material images contaminated by noise.
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên