Clustering analysis of multivariate data: a weighted spatial ranks-based approach

Mohammed Baragilly, Hend Gabr, Brian H Willis*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

131 Downloads (Pure)

Abstract

Determining the right number of clusters without any prior information about their numbers is a core problem in cluster analysis. In this paper, we propose a nonparametric clustering method based on different weighted spatial rank (WSR) functions. The main idea behind WSR is to define a dissimilarity measure locally based on a localized version of multivariate ranks. We consider a nonparametric Gaussian kernel weights function. We compare the performance of the method with other standard techniques and assess its misclassification rate. The method is completely data-driven, robust against distributional assumptions, and accurate for the purpose of intuitive visualization and can be used both to determine the number of clusters and assign each observation to its cluster.
Original languageEnglish
Article number8849404
Number of pages15
JournalJournal of Probability and Statistics
Volume2023
DOIs
Publication statusPublished - 30 Sept 2023

Bibliographical note

Acknowledgments:
The authors would like to thank Biman Chakraborty for his helpful discussions and suggestions relating to this work, and BHW and MB were supported by a Clinician Scientist award with the Medical Research Council, UK (MR/N007999/1).

Fingerprint

Dive into the research topics of 'Clustering analysis of multivariate data: a weighted spatial ranks-based approach'. Together they form a unique fingerprint.

Cite this