Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking

Jingjing Xiao; Rustam Stolkin; Ales Leonardis

doi:10.1016/j.patcog.2017.04.001

Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking

Jingjing Xiao, Rustam Stolkin, Ales Leonardis

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

225 Downloads (Pure)

Abstract

This paper presents a tracking algorithm for arbitrary objects in challenging video sequences. Targets are modelled at three different levels of granularity (pixel, parts and bounding box levels), which are cross-constrained to enable robust model relearning. The main contribution is an adaptive clustered decision tree method which dynamically selects the minimum combination of features necessary to sufficiently represent each target part at each frame, thereby providing robustness with computational efficiency. The adaptive clustered decision tree is used in two separate ways: firstly for parts level matching between successive frames; secondly to select the best candidate image regions for learning new parts of the target. We test the tracker using two different tracking benchmarks (VOT2013-2014 and CVPR2013 tracking challenges), based on two different test methodologies, and show it to be more robust than the state-of-the-art methods from both of those tracking challenges, while also offering competitive tracking precision. Additionally, we evaluate the contribution of each key component of the tracker to overall performance; test the sensitivity of the tracker under different initialization conditions; investigate the effect of using features in different orders within the decision trees; illustrate the flexibility of the method for handling arbitrary kinds of features, by showing how it easily extends to handle RGB-D data.

Original language	English
Pages (from-to)	169-183
Journal	Pattern Recognition
Volume	69
Early online date	14 Apr 2017
DOIs	https://doi.org/10.1016/j.patcog.2017.04.001
Publication status	Published - 1 Sept 2017

Keywords

Single target tracking
Adaptive clustered decision trees
Multi-level appearance models

Access to Document

10.1016/j.patcog.2017.04.001Licence: None: All rights reserved

Xiao_et_al_Dynamic_multi-level_Pattern_Recognition_2017Accepted author manuscript, 1.69 MBLicence: Creative Commons: Attribution-NonCommercial-NoDerivs (CC BY-NC-ND)

http://www.sciencedirect.com/science/article/pii/S0031320317301486Licence: None: All rights reserved

Cite this

@article{6944abfa37f94e01afe1be5c19544833,

title = "Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking",

abstract = "This paper presents a tracking algorithm for arbitrary objects in challenging video sequences. Targets are modelled at three different levels of granularity (pixel, parts and bounding box levels), which are cross-constrained to enable robust model relearning. The main contribution is an adaptive clustered decision tree method which dynamically selects the minimum combination of features necessary to sufficiently represent each target part at each frame, thereby providing robustness with computational efficiency. The adaptive clustered decision tree is used in two separate ways: firstly for parts level matching between successive frames; secondly to select the best candidate image regions for learning new parts of the target. We test the tracker using two different tracking benchmarks (VOT2013-2014 and CVPR2013 tracking challenges), based on two different test methodologies, and show it to be more robust than the state-of-the-art methods from both of those tracking challenges, while also offering competitive tracking precision. Additionally, we evaluate the contribution of each key component of the tracker to overall performance; test the sensitivity of the tracker under different initialization conditions; investigate the effect of using features in different orders within the decision trees; illustrate the flexibility of the method for handling arbitrary kinds of features, by showing how it easily extends to handle RGB-D data.",

keywords = "Single target tracking, Adaptive clustered decision trees, Multi-level appearance models",

author = "Jingjing Xiao and Rustam Stolkin and Ales Leonardis",

year = "2017",

month = sep,

day = "1",

doi = "10.1016/j.patcog.2017.04.001",

language = "English",

volume = "69",

pages = "169--183",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier",

}

TY - JOUR

T1 - Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking

AU - Xiao, Jingjing

AU - Stolkin, Rustam

AU - Leonardis, Ales

PY - 2017/9/1

Y1 - 2017/9/1

N2 - This paper presents a tracking algorithm for arbitrary objects in challenging video sequences. Targets are modelled at three different levels of granularity (pixel, parts and bounding box levels), which are cross-constrained to enable robust model relearning. The main contribution is an adaptive clustered decision tree method which dynamically selects the minimum combination of features necessary to sufficiently represent each target part at each frame, thereby providing robustness with computational efficiency. The adaptive clustered decision tree is used in two separate ways: firstly for parts level matching between successive frames; secondly to select the best candidate image regions for learning new parts of the target. We test the tracker using two different tracking benchmarks (VOT2013-2014 and CVPR2013 tracking challenges), based on two different test methodologies, and show it to be more robust than the state-of-the-art methods from both of those tracking challenges, while also offering competitive tracking precision. Additionally, we evaluate the contribution of each key component of the tracker to overall performance; test the sensitivity of the tracker under different initialization conditions; investigate the effect of using features in different orders within the decision trees; illustrate the flexibility of the method for handling arbitrary kinds of features, by showing how it easily extends to handle RGB-D data.

AB - This paper presents a tracking algorithm for arbitrary objects in challenging video sequences. Targets are modelled at three different levels of granularity (pixel, parts and bounding box levels), which are cross-constrained to enable robust model relearning. The main contribution is an adaptive clustered decision tree method which dynamically selects the minimum combination of features necessary to sufficiently represent each target part at each frame, thereby providing robustness with computational efficiency. The adaptive clustered decision tree is used in two separate ways: firstly for parts level matching between successive frames; secondly to select the best candidate image regions for learning new parts of the target. We test the tracker using two different tracking benchmarks (VOT2013-2014 and CVPR2013 tracking challenges), based on two different test methodologies, and show it to be more robust than the state-of-the-art methods from both of those tracking challenges, while also offering competitive tracking precision. Additionally, we evaluate the contribution of each key component of the tracker to overall performance; test the sensitivity of the tracker under different initialization conditions; investigate the effect of using features in different orders within the decision trees; illustrate the flexibility of the method for handling arbitrary kinds of features, by showing how it easily extends to handle RGB-D data.

KW - Single target tracking

KW - Adaptive clustered decision trees

KW - Multi-level appearance models

U2 - 10.1016/j.patcog.2017.04.001

DO - 10.1016/j.patcog.2017.04.001

M3 - Article

SN - 0031-3203

VL - 69

SP - 169

EP - 183

JO - Pattern Recognition

JF - Pattern Recognition

ER -

Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking

Abstract

Keywords

Access to Document

Fingerprint

Cite this