Bi-clustering gene expression data using co-similarity

Syed Fawad Hussain*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Citations (Scopus)

Abstract

We propose a new framework for bi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept, which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings
Pages190-200
Number of pages11
EditionPART 1
DOIs
Publication statusPublished - 2011
Event7th International Conference on Advanced Data Mining and Applications, ADMA 2011 - Beijing, China
Duration: 17 Dec 201119 Dec 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume7120 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Advanced Data Mining and Applications, ADMA 2011
Country/TerritoryChina
CityBeijing
Period17/12/1119/12/11

Keywords

  • Bi-clustering
  • Co-similarity
  • Gene Expression Analysis

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Bi-clustering gene expression data using co-similarity'. Together they form a unique fingerprint.

Cite this