Bi-clustering gene expression data using co-similarity

Syed Fawad Hussain

doi:10.1007/978-3-642-25853-4_15

Bi-clustering gene expression data using co-similarity

Syed Fawad Hussain^*

^*Corresponding author for this work

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

15 Citations (Scopus)

Abstract

We propose a new framework for bi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept, which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.

Original language	English
Title of host publication	Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings
Pages	190-200
Number of pages	11
Edition	PART 1
DOIs	https://doi.org/10.1007/978-3-642-25853-4_15
Publication status	Published - 2011
Event	7th International Conference on Advanced Data Mining and Applications, ADMA 2011 - Beijing, China Duration: 17 Dec 2011 → 19 Dec 2011

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Number	PART 1
Volume	7120 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	7th International Conference on Advanced Data Mining and Applications, ADMA 2011
Country/Territory	China
City	Beijing
Period	17/12/11 → 19/12/11

Keywords

Bi-clustering
Co-similarity
Gene Expression Analysis

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1007/978-3-642-25853-4_15

Cite this

Hussain, S. F. (2011). Bi-clustering gene expression data using co-similarity. In Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings (PART 1 ed., pp. 190-200). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7120 LNAI, No. PART 1). https://doi.org/10.1007/978-3-642-25853-4_15

@inproceedings{9dde8a3213a8496b9c648b3555a95002,

title = "Bi-clustering gene expression data using co-similarity",

abstract = "We propose a new framework for bi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept, which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.",

keywords = "Bi-clustering, Co-similarity, Gene Expression Analysis",

author = "Hussain, {Syed Fawad}",

year = "2011",

doi = "10.1007/978-3-642-25853-4_15",

language = "English",

isbn = "9783642258527",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

number = "PART 1",

pages = "190--200",

booktitle = "Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings",

edition = "PART 1",

note = "7th International Conference on Advanced Data Mining and Applications, ADMA 2011 ; Conference date: 17-12-2011 Through 19-12-2011",

}

Hussain, SF 2011, Bi-clustering gene expression data using co-similarity. in Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings. PART 1 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 7120 LNAI, pp. 190-200, 7th International Conference on Advanced Data Mining and Applications, ADMA 2011, Beijing, China, 17/12/11. https://doi.org/10.1007/978-3-642-25853-4_15

Bi-clustering gene expression data using co-similarity. / Hussain, Syed Fawad.
Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings. PART 1. ed. 2011. p. 190-200 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7120 LNAI, No. PART 1).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Bi-clustering gene expression data using co-similarity

AU - Hussain, Syed Fawad

PY - 2011

Y1 - 2011

N2 - We propose a new framework for bi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept, which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.

AB - We propose a new framework for bi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept, which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.

KW - Bi-clustering

KW - Co-similarity

KW - Gene Expression Analysis

UR - http://www.scopus.com/inward/record.url?scp=84255176339&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-25853-4_15

DO - 10.1007/978-3-642-25853-4_15

M3 - Conference contribution

AN - SCOPUS:84255176339

SN - 9783642258527

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 190

EP - 200

BT - Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings

T2 - 7th International Conference on Advanced Data Mining and Applications, ADMA 2011

Y2 - 17 December 2011 through 19 December 2011

ER -

Hussain SF. Bi-clustering gene expression data using co-similarity. In Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings. PART 1 ed. 2011. p. 190-200. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). doi: 10.1007/978-3-642-25853-4_15

Bi-clustering gene expression data using co-similarity

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Fingerprint

Cite this