Weighted multi-view co-clustering (WMVCC) for sparse data

Syed Fawad Hussain; Khadija Khan; Rashad Jillani

doi:10.1007/s10489-021-02405-3

Weighted multi-view co-clustering (WMVCC) for sparse data

Syed Fawad Hussain^*, Khadija Khan, Rashad Jillani

^*Corresponding author for this work

Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

Multi-view clustering has gained importance in recent times due to the large-scale generation of data, often from multiple sources. Multi-view clustering refers to clustering a set of objects which are expressed by multiple set of features, known as views, such as movies being expressed by the list of actors or by a textual summary of its plot. Co-clustering, on the other hand, refers to the simultaneous grouping of data samples and features under the assumption that samples exhibit a pattern only under a subset of features. This paper combines multi-view clustering with co-clustering and proposes a new Weighted Multi-View Co-Clustering (WMVCC) algorithm. The motivation behind the approach is to use the diversity of features provided by multiple sources of information while exploiting the power of co-clustering. The proposed method expands the clustering objective function to a unified co-clustering objective function across all the multiple views. The algorithm follows the k-means strategy and iteratively optimizes the clustering by updating cluster labels, features, and view weights. A local search is also employed to optimize the clustering result using weighted multi-step paths in a graph. Experiments are conducted on several benchmark datasets. The results show that the proposed approach converges quickly, and the clustering performance significantly outperforms other recent and state-of-the-art algorithms on sparse datasets.

Original language	English
Pages (from-to)	398–416
Number of pages	19
Journal	Applied Intelligence
Volume	52
Issue number	1
Early online date	1 May 2021
DOIs	https://doi.org/10.1007/s10489-021-02405-3
Publication status	Published - Jan 2022

Keywords

Clustering
Co-clustering
Information fusion
Multi-view clustering

Access to Document

10.1007/s10489-021-02405-3

Cite this

@article{32a7615bdfd64d82b461d872d8267aac,

title = "Weighted multi-view co-clustering (WMVCC) for sparse data",

abstract = "Multi-view clustering has gained importance in recent times due to the large-scale generation of data, often from multiple sources. Multi-view clustering refers to clustering a set of objects which are expressed by multiple set of features, known as views, such as movies being expressed by the list of actors or by a textual summary of its plot. Co-clustering, on the other hand, refers to the simultaneous grouping of data samples and features under the assumption that samples exhibit a pattern only under a subset of features. This paper combines multi-view clustering with co-clustering and proposes a new Weighted Multi-View Co-Clustering (WMVCC) algorithm. The motivation behind the approach is to use the diversity of features provided by multiple sources of information while exploiting the power of co-clustering. The proposed method expands the clustering objective function to a unified co-clustering objective function across all the multiple views. The algorithm follows the k-means strategy and iteratively optimizes the clustering by updating cluster labels, features, and view weights. A local search is also employed to optimize the clustering result using weighted multi-step paths in a graph. Experiments are conducted on several benchmark datasets. The results show that the proposed approach converges quickly, and the clustering performance significantly outperforms other recent and state-of-the-art algorithms on sparse datasets.",

keywords = "Clustering, Co-clustering, Information fusion, Multi-view clustering",

author = "Hussain, {Syed Fawad} and Khadija Khan and Rashad Jillani",

year = "2022",

month = jan,

doi = "10.1007/s10489-021-02405-3",

language = "English",

volume = "52",

pages = "398–416",

journal = "Applied Intelligence",

issn = "0924-669X",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - Weighted multi-view co-clustering (WMVCC) for sparse data

AU - Hussain, Syed Fawad

AU - Khan, Khadija

AU - Jillani, Rashad

PY - 2022/1

Y1 - 2022/1

N2 - Multi-view clustering has gained importance in recent times due to the large-scale generation of data, often from multiple sources. Multi-view clustering refers to clustering a set of objects which are expressed by multiple set of features, known as views, such as movies being expressed by the list of actors or by a textual summary of its plot. Co-clustering, on the other hand, refers to the simultaneous grouping of data samples and features under the assumption that samples exhibit a pattern only under a subset of features. This paper combines multi-view clustering with co-clustering and proposes a new Weighted Multi-View Co-Clustering (WMVCC) algorithm. The motivation behind the approach is to use the diversity of features provided by multiple sources of information while exploiting the power of co-clustering. The proposed method expands the clustering objective function to a unified co-clustering objective function across all the multiple views. The algorithm follows the k-means strategy and iteratively optimizes the clustering by updating cluster labels, features, and view weights. A local search is also employed to optimize the clustering result using weighted multi-step paths in a graph. Experiments are conducted on several benchmark datasets. The results show that the proposed approach converges quickly, and the clustering performance significantly outperforms other recent and state-of-the-art algorithms on sparse datasets.

AB - Multi-view clustering has gained importance in recent times due to the large-scale generation of data, often from multiple sources. Multi-view clustering refers to clustering a set of objects which are expressed by multiple set of features, known as views, such as movies being expressed by the list of actors or by a textual summary of its plot. Co-clustering, on the other hand, refers to the simultaneous grouping of data samples and features under the assumption that samples exhibit a pattern only under a subset of features. This paper combines multi-view clustering with co-clustering and proposes a new Weighted Multi-View Co-Clustering (WMVCC) algorithm. The motivation behind the approach is to use the diversity of features provided by multiple sources of information while exploiting the power of co-clustering. The proposed method expands the clustering objective function to a unified co-clustering objective function across all the multiple views. The algorithm follows the k-means strategy and iteratively optimizes the clustering by updating cluster labels, features, and view weights. A local search is also employed to optimize the clustering result using weighted multi-step paths in a graph. Experiments are conducted on several benchmark datasets. The results show that the proposed approach converges quickly, and the clustering performance significantly outperforms other recent and state-of-the-art algorithms on sparse datasets.

KW - Clustering

KW - Co-clustering

KW - Information fusion

KW - Multi-view clustering

UR - http://www.scopus.com/inward/record.url?scp=85105381100&partnerID=8YFLogxK

U2 - 10.1007/s10489-021-02405-3

DO - 10.1007/s10489-021-02405-3

M3 - Article

SN - 0924-669X

VL - 52

SP - 398

EP - 416

JO - Applied Intelligence

JF - Applied Intelligence

IS - 1

ER -

Weighted multi-view co-clustering (WMVCC) for sparse data

Abstract

Keywords

Access to Document

Fingerprint

Cite this