Journal article Open Access

A parallel ADMM-based convex clustering method

Fodor Lidija; Jakovetić Dušan; Boberić Krstićev Danijela; Škrbić Srđan

Convex clustering has received recently an increased interest as a valuable method for unsupervised learning. Unlike conventional clustering methods such as k-means, its formulation corresponds to solving a convex optimization problem and hence, alleviates initialization and local minima problems. However, while several algorithms have been proposed to solve convex clustering formulations, including those based on the alternating direction method of multipliers (ADMM), there is currently a limited body of work on developing scalable parallel and distributed algorithms and solvers for convex clustering. In this paper, we develop a parallel, ADMM-based method, for a modified convex clustering sum-of-norms (SON) formulation for master–worker architectures, where the data to be clustered are partitioned across a number of worker nodes, and we provide its efficient, open-source implementation (available on Parallel ADMM-based convex clustering. https://github.com/lidijaf/Parallel-ADMM-based-convex-clustering. Accessed on 10 June 2022) for high-performance computing (HPC) cluster environments. Extensive numerical evaluations on real and synthetic data sets demonstrate a high degree of scalability and efficiency of the method, when compared with existing alternative solvers for convex clustering.

LF developed the implementation of the algorithm and performed the empirical evaluations. DJ contributed with the theoretical advances and design of algorithm. DBK and SS contributed to improving the quality of experimentation and design. All authors participated in the main research flow development and in writing and revising the manuscript. All authors read and approved the final manuscript. The code for parallel ADMM-based convex clustering can be found in the following GitHub repository: https://github.com/lidijaf/Parallel-ADMM-based-convex-clustering. The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.
Files (3.6 MB)
Name Size
Fodor_et_al.pdf
md5:a26c567c8401ffde266afc8db045dc2d
3.6 MB Download
13
11
views
downloads
Views 13
Downloads 11
Data volume 39.4 MB
Unique views 9
Unique downloads 8

Share

Cite as