VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning

Jongwon Choi; Kwang Moo Yi; Jihoon Kim; Jinho Choo; Byoungjip Kim; Jinyeop Chang; Youngjune Gwon; Hyung Jin Chang

doi:10.1109/CVPR46437.2021.00668

VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning

Jongwon Choi, Kwang Moo Yi, Jihoon Kim, Jinho Choo, Byoungjip Kim, Jinyeop Chang, Youngjune Gwon, Hyung Jin Chang

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

183 Downloads (Pure)

Abstract

Active Learning for discriminative models has largely been studied with the focus on individual samples, with less emphasis on how classes are distributed or which classes are hard to deal with. In this work, we show that this is harmful. We propose a method based on the Bayes’ rule, that can naturally incorporate class imbalance into the Active Learning framework. We derive that three terms should be considered together when estimating the probability of a classifier making a mistake for a given sample; i) probability of mislabelling a class, ii) likelihood of the data given a predicted class, and iii) the prior probability on the abundance of a predicted class. Implementing these terms requires a generative model and an intractable likelihood estimation. Therefore, we train a Variational Auto Encoder (VAE) for this purpose. To further tie the VAE with the classifier and facilitate VAE training, we use the classifiers’ deep feature representations as input to the VAE. By considering all three probabilities, among them, especially the data imbalance, we can substantially improve the potential of existing methods under limited data budget. We show that our method can be applied to classification tasks on multiple different datasets – including one that is a real-world dataset with heavy data imbalance – significantly outperforming the state of the art.

Original language	English
Title of host publication	2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Publisher	IEEE
Pages	6745-6754
Number of pages	10
ISBN (Electronic)	9781665445092
ISBN (Print)	9781665445108
DOIs	https://doi.org/10.1109/CVPR46437.2021.00668
Publication status	Published - 2 Nov 2021
Event	2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition - Nashville, United States Duration: 20 Jun 2021 → 25 Jun 2021

Publication series

Name	Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Publisher	IEEE
ISSN (Print)	1063-6919
ISSN (Electronic)	2575-7075

Conference

Conference	2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Abbreviated title	CVPR 2021
Country/Territory	United States
City	Nashville
Period	20/06/21 → 25/06/21

Keywords

training
learning systems
computer vision
estimation
object detection
pattern recognition
task analysis

Access to Document

10.1109/CVPR46437.2021.00668Licence: None: All rights reserved

ChoiJ2021VaBAL
This is the accepted manuscript of J. Choi et al., "VaB-AL: Incorporating Class Imbalance and Difficulty with Variational Bayes for Active Learning," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 6745-6754, doi: 10.1109/CVPR46437.2021.00668. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). © 2021IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted author manuscript, 710 KBLicence: Other (please specify with Rights Statement)

Cite this

Choi, J., Yi, K. M., Kim, J., Choo, J., Kim, B., Chang, J., Gwon, Y., & Chang, H. J. (2021). VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 6745-6754). Article 9577648 (Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition). IEEE. https://doi.org/10.1109/CVPR46437.2021.00668

@inproceedings{2108b3856a164a16aa4203413809f20e,

title = "VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning",

abstract = "Active Learning for discriminative models has largely been studied with the focus on individual samples, with less emphasis on how classes are distributed or which classes are hard to deal with. In this work, we show that this is harmful. We propose a method based on the Bayes{\textquoteright} rule, that can naturally incorporate class imbalance into the Active Learning framework. We derive that three terms should be considered together when estimating the probability of a classifier making a mistake for a given sample; i) probability of mislabelling a class, ii) likelihood of the data given a predicted class, and iii) the prior probability on the abundance of a predicted class. Implementing these terms requires a generative model and an intractable likelihood estimation. Therefore, we train a Variational Auto Encoder (VAE) for this purpose. To further tie the VAE with the classifier and facilitate VAE training, we use the classifiers{\textquoteright} deep feature representations as input to the VAE. By considering all three probabilities, among them, especially the data imbalance, we can substantially improve the potential of existing methods under limited data budget. We show that our method can be applied to classification tasks on multiple different datasets – including one that is a real-world dataset with heavy data imbalance – significantly outperforming the state of the art.",

keywords = "training, learning systems, computer vision, estimation, object detection, pattern recognition, task analysis",

author = "Jongwon Choi and Yi, {Kwang Moo} and Jihoon Kim and Jinho Choo and Byoungjip Kim and Jinyeop Chang and Youngjune Gwon and Chang, {Hyung Jin}",

year = "2021",

month = nov,

day = "2",

doi = "10.1109/CVPR46437.2021.00668",

language = "English",

isbn = "9781665445108",

series = "Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE",

pages = "6745--6754",

booktitle = "2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",

note = "2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021 ; Conference date: 20-06-2021 Through 25-06-2021",

}

Choi, J, Yi, KM, Kim, J, Choo, J, Kim, B, Chang, J, Gwon, Y & Chang, HJ 2021, VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)., 9577648, Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, pp. 6745-6754, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, Tennessee, United States, 20/06/21. https://doi.org/10.1109/CVPR46437.2021.00668

VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning. / Choi, Jongwon; Yi, Kwang Moo; Kim, Jihoon et al.
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2021. p. 6745-6754 9577648 (Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - VaB-AL

T2 - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition

AU - Choi, Jongwon

AU - Yi, Kwang Moo

AU - Kim, Jihoon

AU - Choo, Jinho

AU - Kim, Byoungjip

AU - Chang, Jinyeop

AU - Gwon, Youngjune

AU - Chang, Hyung Jin

PY - 2021/11/2

Y1 - 2021/11/2

N2 - Active Learning for discriminative models has largely been studied with the focus on individual samples, with less emphasis on how classes are distributed or which classes are hard to deal with. In this work, we show that this is harmful. We propose a method based on the Bayes’ rule, that can naturally incorporate class imbalance into the Active Learning framework. We derive that three terms should be considered together when estimating the probability of a classifier making a mistake for a given sample; i) probability of mislabelling a class, ii) likelihood of the data given a predicted class, and iii) the prior probability on the abundance of a predicted class. Implementing these terms requires a generative model and an intractable likelihood estimation. Therefore, we train a Variational Auto Encoder (VAE) for this purpose. To further tie the VAE with the classifier and facilitate VAE training, we use the classifiers’ deep feature representations as input to the VAE. By considering all three probabilities, among them, especially the data imbalance, we can substantially improve the potential of existing methods under limited data budget. We show that our method can be applied to classification tasks on multiple different datasets – including one that is a real-world dataset with heavy data imbalance – significantly outperforming the state of the art.

AB - Active Learning for discriminative models has largely been studied with the focus on individual samples, with less emphasis on how classes are distributed or which classes are hard to deal with. In this work, we show that this is harmful. We propose a method based on the Bayes’ rule, that can naturally incorporate class imbalance into the Active Learning framework. We derive that three terms should be considered together when estimating the probability of a classifier making a mistake for a given sample; i) probability of mislabelling a class, ii) likelihood of the data given a predicted class, and iii) the prior probability on the abundance of a predicted class. Implementing these terms requires a generative model and an intractable likelihood estimation. Therefore, we train a Variational Auto Encoder (VAE) for this purpose. To further tie the VAE with the classifier and facilitate VAE training, we use the classifiers’ deep feature representations as input to the VAE. By considering all three probabilities, among them, especially the data imbalance, we can substantially improve the potential of existing methods under limited data budget. We show that our method can be applied to classification tasks on multiple different datasets – including one that is a real-world dataset with heavy data imbalance – significantly outperforming the state of the art.

KW - training

KW - learning systems

KW - computer vision

KW - estimation

KW - object detection

KW - pattern recognition

KW - task analysis

UR - https://ieeexplore.ieee.org/xpl/conhome/1000147/all-proceedings

U2 - 10.1109/CVPR46437.2021.00668

DO - 10.1109/CVPR46437.2021.00668

M3 - Conference contribution

SN - 9781665445108

T3 - Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 6745

EP - 6754

BT - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

PB - IEEE

Y2 - 20 June 2021 through 25 June 2021

ER -

Choi J, Yi KM, Kim J, Choo J, Kim B, Chang J et al. VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. 2021. p. 6745-6754. 9577648. (Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR46437.2021.00668

VaB-AL: incorporating class imbalance and difficulty with variational Bayes for active learning

Abstract

Publication series

Conference

Keywords

Access to Document

Fingerprint

Cite this