Stochastic proximal AUC maximization

Yunwen Lei; Yiming Ying

Stochastic proximal AUC maximization

Yunwen Lei, Yiming Ying

Computer Science

Research output: Contribution to journal › Article › peer-review

73 Downloads (Pure)

Abstract

In this paper we consider the problem of maximizing the Area under the ROC curve (AUC) which is a widely used performance metric in imbalanced classification and anomaly detection. Due to the pairwise nonlinearity of the objective function, classical SGD algorithms do not apply to the task of AUC maximization. We propose a novel stochastic proximal algorithm for AUC maximization which is scalable to large scale streaming data. Our algorithm can accommodate general penalty terms and is easy to implement with favorable O(d) space and per-iteration time complexities. We establish a high-probability convergence rate O(1/√T) for the general convex setting, and improve it to a fast convergence rate O(1/T) for the cases of strongly convex regularizers and no regularization term (without strong convexity). Our proof does not need the uniform boundedness assumption on the loss function or the iterates which is more fidelity to the practice. Finally, we perform extensive experiments over various benchmark data sets from real-world application domains which show the superior performance of our algorithm over the existing AUC maximization algorithms.

Original language	English
Article number	61
Number of pages	45
Journal	Journal of Machine Learning Research
Volume	22
Publication status	Published - 28 Feb 2021

Keywords

AUC maximization
imbalanced classification
proximal operator
stochastic gradient descent

Access to Document

LeiY2021StochasticFinal published version, 1.21 MBLicence: Creative Commons: Attribution (CC BY)

https://jmlr.org/papers/volume22/19-418/19-418.pdfLicence: Creative Commons: Attribution (CC BY)

Cite this

@article{a4c07eba82e742f09d984f9426f6824b,

title = "Stochastic proximal AUC maximization",

abstract = "In this paper we consider the problem of maximizing the Area under the ROC curve (AUC) which is a widely used performance metric in imbalanced classification and anomaly detection. Due to the pairwise nonlinearity of the objective function, classical SGD algorithms do not apply to the task of AUC maximization. We propose a novel stochastic proximal algorithm for AUC maximization which is scalable to large scale streaming data. Our algorithm can accommodate general penalty terms and is easy to implement with favorable O(d) space and per-iteration time complexities. We establish a high-probability convergence rate O(1/√T) for the general convex setting, and improve it to a fast convergence rate O(1/T) for the cases of strongly convex regularizers and no regularization term (without strong convexity). Our proof does not need the uniform boundedness assumption on the loss function or the iterates which is more fidelity to the practice. Finally, we perform extensive experiments over various benchmark data sets from real-world application domains which show the superior performance of our algorithm over the existing AUC maximization algorithms. ",

keywords = "AUC maximization, imbalanced classification, proximal operator, stochastic gradient descent",

author = "Yunwen Lei and Yiming Ying",

year = "2021",

month = feb,

day = "28",

language = "English",

volume = "22",

journal = "Journal of Machine Learning Research",

issn = "1532-4435",

publisher = "Journal of Machine Learning Research",

}

TY - JOUR

T1 - Stochastic proximal AUC maximization

AU - Lei, Yunwen

AU - Ying, Yiming

PY - 2021/2/28

Y1 - 2021/2/28

N2 - In this paper we consider the problem of maximizing the Area under the ROC curve (AUC) which is a widely used performance metric in imbalanced classification and anomaly detection. Due to the pairwise nonlinearity of the objective function, classical SGD algorithms do not apply to the task of AUC maximization. We propose a novel stochastic proximal algorithm for AUC maximization which is scalable to large scale streaming data. Our algorithm can accommodate general penalty terms and is easy to implement with favorable O(d) space and per-iteration time complexities. We establish a high-probability convergence rate O(1/√T) for the general convex setting, and improve it to a fast convergence rate O(1/T) for the cases of strongly convex regularizers and no regularization term (without strong convexity). Our proof does not need the uniform boundedness assumption on the loss function or the iterates which is more fidelity to the practice. Finally, we perform extensive experiments over various benchmark data sets from real-world application domains which show the superior performance of our algorithm over the existing AUC maximization algorithms.

AB - In this paper we consider the problem of maximizing the Area under the ROC curve (AUC) which is a widely used performance metric in imbalanced classification and anomaly detection. Due to the pairwise nonlinearity of the objective function, classical SGD algorithms do not apply to the task of AUC maximization. We propose a novel stochastic proximal algorithm for AUC maximization which is scalable to large scale streaming data. Our algorithm can accommodate general penalty terms and is easy to implement with favorable O(d) space and per-iteration time complexities. We establish a high-probability convergence rate O(1/√T) for the general convex setting, and improve it to a fast convergence rate O(1/T) for the cases of strongly convex regularizers and no regularization term (without strong convexity). Our proof does not need the uniform boundedness assumption on the loss function or the iterates which is more fidelity to the practice. Finally, we perform extensive experiments over various benchmark data sets from real-world application domains which show the superior performance of our algorithm over the existing AUC maximization algorithms.

KW - AUC maximization

KW - imbalanced classification

KW - proximal operator

KW - stochastic gradient descent

UR - http://www.scopus.com/inward/record.url?scp=85105893722&partnerID=8YFLogxK

M3 - Article

SN - 1532-4435

VL - 22

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

M1 - 61

ER -

Stochastic proximal AUC maximization

Abstract

Keywords

Access to Document

Fingerprint

Cite this