Robust object detection with interleaved categorization and segmentation

B. Leibe; A. Leonardis; B. Schiele

doi:10.1007/s11263-007-0095-3

Robust object detection with interleaved categorization and segmentation

B. Leibe, A. Leonardis, B. Schiele

Computer Science

Research output: Contribution to journal › Article › peer-review

1060 Citations (Scopus)

Abstract

This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.

Original language	English
Pages (from-to)	259-289
Number of pages	31
Journal	International Journal of Computer Vision
Volume	77
Issue number	1-3
DOIs	https://doi.org/10.1007/s11263-007-0095-3
Publication status	Published - 1 May 2008

Bibliographical note

Access to Document

10.1007/s11263-007-0095-3

Cite this

@article{c246e006bd654017a7a272f5d798b364,

title = "Robust object detection with interleaved categorization and segmentation",

abstract = "This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.",

author = "B. Leibe and A. Leonardis and B. Schiele",

year = "2008",

month = may,

day = "1",

doi = "10.1007/s11263-007-0095-3",

language = "English",

volume = "77",

pages = "259--289",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer",

number = "1-3",

}

TY - JOUR

T1 - Robust object detection with interleaved categorization and segmentation

AU - Leibe, B.

AU - Leonardis, A.

AU - Schiele, B.

PY - 2008/5/1

Y1 - 2008/5/1

N2 - This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.

AB - This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.

UR - http://www.scopus.com/inward/record.url?partnerID=yv4JPVwI&eid=2-s2.0-39749124915&md5=32d2efa6cf9946c35059339d560ad3c6

U2 - 10.1007/s11263-007-0095-3

DO - 10.1007/s11263-007-0095-3

M3 - Article

AN - SCOPUS:39749124915

SN - 0920-5691

VL - 77

SP - 259

EP - 289

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 1-3

ER -

Robust object detection with interleaved categorization and segmentation

Abstract

Bibliographical note

Access to Document

Fingerprint

Cite this