Skip to main navigation Skip to search Skip to main content

One-Shot Learning of Poisson Distributions in Serial Analysis of Gene Expression

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Traditionally, studies in learning theory tend to concentrate on situations where potentially ever increasing number of training examples is available. However, there are situations where only extremely small samples can be used in order to perform an inference. In such situations it is of utmost importance to theoretically analyze what and under what circumstances can be learned. One such scenario is detection of differentially expressed genes. In our previous study (BMC Bioinformatics, 2009) we theoretically analyzed one of the most popular techniques for identifying genes with statistically different expression in SAGE libraries - the Audic-Claverie statistic (Genome Research, 1997). When comparing two libraries in the Audic-Claverie framework, it is assumed that under the null hypothesis their tag counts come from the same underlying (unknown) Poisson distribution. Since each SAGE library represents a single measurement, the inference has to be performed on the smallest sample possible - sample of size 1. In this contribution we compare the Audic-Claverie approach with a (regularized) maximum likelihood (ML) framework. We analytically approximate the expected K-L divergence from the true unknown Poisson distribution to the model and show that while the expected K-L divergence to the ML-estimated models seems to be always larger than that of the Audic-Claverie statistic, the most divergence appears for true Poisson distributions with small mean parameter. We also theoretically analyze the effect of regularization of ML estimates in the case of zero observed counts. Our results constitute a rigorous analysis of a situation of great practical importance where the benefits of Bayesian approach can be clearly demonstrated in a quantitative and principled manner.
Original languageEnglish
Title of host publicationAdvances in Neural Networks -- ISNN 2011
Subtitle of host publication8th International Symposium on Neural Networks, ISNN 2011, Guilin, China, May 29--June 1, 2011, Proceedings, Part II
EditorsDerong Liu, Huaguang Zhang, Marios Polycarpou, Cesare Alippi, Haibo He
PublisherSpringer
Pages37-46
Number of pages10
Edition1
ISBN (Electronic)9783642210907
ISBN (Print)9783642210891
DOIs
Publication statusPublished - 10 May 2011

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume6676
ISSN (Print)0302-9743
ISSN (Electronic)611-3349

Keywords

  • Audic-Claverie statistic
  • Bayesian averaging
  • Poisson distribution
  • Kullback-Leibler divergence
  • differential gene expression

Fingerprint

Dive into the research topics of 'One-Shot Learning of Poisson Distributions in Serial Analysis of Gene Expression'. Together they form a unique fingerprint.

Cite this