Stability and generalization of stochastic gradient methods for minimax problems

Yunwen Lei; Zhenhuan Yang; Tianbao Yang; Yiming Ying

Stability and generalization of stochastic gradient methods for minimax problems

Yunwen Lei, Zhenhuan Yang, Tianbao Yang, Yiming Ying

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

73 Downloads (Pure)

Abstract

Many machine learning problems can be formulated as minimax problems such as Generative Adversarial Networks (GANs), AUC maximization and robust estimation, to mention but a few. A substantial amount of studies are devoted to studying the convergence behavior of their stochastic gradient-type algorithms. In contrast, there is relatively little work on their generalization, i.e., how the learning models built from training examples would behave on test examples. In this paper, we provide a comprehensive generalization analysis of stochastic gradient methods for minimax problems under both convex-concave and nonconvex-nonconcave cases through the lens of algorithmic stability. We establish a quantitative connection between stability and several generalization measures both in expectation and with high probability. For the convex-concave setting, our stability analysis shows that stochastic gradient descent ascent attains optimal generalization bounds for both smooth and nonsmooth minimax problems. We also establish generalization bounds for both weakly-convex-weakly-concave and gradient-dominated problems.

Original language	English
Title of host publication	Proceedings of ICML 2021
Editors	Marina Meila, Tong Zhang
Publisher	JMLR
Pages	6175-6186
Number of pages	12
Publication status	Published - 18 Jul 2021
Event	The Thirty-eighth International Conference on Machine Learning - Virtual Duration: 18 Jul 2021 → 24 Jul 2021 https://icml.cc/

Publication series

Name	Proceedings of Machine Learning Research
Volume	139
ISSN (Electronic)	2640-3498

Conference

Conference	The Thirty-eighth International Conference on Machine Learning
Abbreviated title	ICML 2021
Period	18/07/21 → 24/07/21
Internet address	https://icml.cc/

Access to Document

LeiY2021StabilityFinal published version, 904 KBLicence: Creative Commons: Attribution (CC BY)

http://proceedings.mlr.press/v139/lei21a.htmlLicence: Creative Commons: Attribution (CC BY)

Cite this

@inproceedings{2be997cc7d344b7ab7c61d0d73176e5d,

title = "Stability and generalization of stochastic gradient methods for minimax problems",

abstract = "Many machine learning problems can be formulated as minimax problems such as Generative Adversarial Networks (GANs), AUC maximization and robust estimation, to mention but a few. A substantial amount of studies are devoted to studying the convergence behavior of their stochastic gradient-type algorithms. In contrast, there is relatively little work on their generalization, i.e., how the learning models built from training examples would behave on test examples. In this paper, we provide a comprehensive generalization analysis of stochastic gradient methods for minimax problems under both convex-concave and nonconvex-nonconcave cases through the lens of algorithmic stability. We establish a quantitative connection between stability and several generalization measures both in expectation and with high probability. For the convex-concave setting, our stability analysis shows that stochastic gradient descent ascent attains optimal generalization bounds for both smooth and nonsmooth minimax problems. We also establish generalization bounds for both weakly-convex-weakly-concave and gradient-dominated problems.",

author = "Yunwen Lei and Zhenhuan Yang and Tianbao Yang and Yiming Ying",

year = "2021",

month = jul,

day = "18",

language = "English",

series = "Proceedings of Machine Learning Research",

publisher = "JMLR ",

pages = "6175--6186",

editor = "Meila, {Marina } and Zhang, {Tong }",

booktitle = "Proceedings of ICML 2021",

note = "The Thirty-eighth International Conference on Machine Learning , ICML 2021 ; Conference date: 18-07-2021 Through 24-07-2021",

url = "https://icml.cc/",

}

Lei, Y, Yang, Z, Yang, T & Ying, Y 2021, Stability and generalization of stochastic gradient methods for minimax problems. in M Meila & T Zhang (eds), Proceedings of ICML 2021. Proceedings of Machine Learning Research, vol. 139, JMLR , pp. 6175-6186, The Thirty-eighth International Conference on Machine Learning , 18/07/21. <http://proceedings.mlr.press/v139/lei21a.html>

TY - GEN

T1 - Stability and generalization of stochastic gradient methods for minimax problems

AU - Lei, Yunwen

AU - Yang, Zhenhuan

AU - Yang, Tianbao

AU - Ying, Yiming

PY - 2021/7/18

Y1 - 2021/7/18

N2 - Many machine learning problems can be formulated as minimax problems such as Generative Adversarial Networks (GANs), AUC maximization and robust estimation, to mention but a few. A substantial amount of studies are devoted to studying the convergence behavior of their stochastic gradient-type algorithms. In contrast, there is relatively little work on their generalization, i.e., how the learning models built from training examples would behave on test examples. In this paper, we provide a comprehensive generalization analysis of stochastic gradient methods for minimax problems under both convex-concave and nonconvex-nonconcave cases through the lens of algorithmic stability. We establish a quantitative connection between stability and several generalization measures both in expectation and with high probability. For the convex-concave setting, our stability analysis shows that stochastic gradient descent ascent attains optimal generalization bounds for both smooth and nonsmooth minimax problems. We also establish generalization bounds for both weakly-convex-weakly-concave and gradient-dominated problems.

AB - Many machine learning problems can be formulated as minimax problems such as Generative Adversarial Networks (GANs), AUC maximization and robust estimation, to mention but a few. A substantial amount of studies are devoted to studying the convergence behavior of their stochastic gradient-type algorithms. In contrast, there is relatively little work on their generalization, i.e., how the learning models built from training examples would behave on test examples. In this paper, we provide a comprehensive generalization analysis of stochastic gradient methods for minimax problems under both convex-concave and nonconvex-nonconcave cases through the lens of algorithmic stability. We establish a quantitative connection between stability and several generalization measures both in expectation and with high probability. For the convex-concave setting, our stability analysis shows that stochastic gradient descent ascent attains optimal generalization bounds for both smooth and nonsmooth minimax problems. We also establish generalization bounds for both weakly-convex-weakly-concave and gradient-dominated problems.

UR - http://proceedings.mlr.press/pmlr-license-agreement.pdf

UR - https://arxiv.org/abs/2105.03793

M3 - Conference contribution

T3 - Proceedings of Machine Learning Research

SP - 6175

EP - 6186

BT - Proceedings of ICML 2021

A2 - Meila, Marina

A2 - Zhang, Tong

PB - JMLR

T2 - The Thirty-eighth International Conference on Machine Learning

Y2 - 18 July 2021 through 24 July 2021

ER -

Stability and generalization of stochastic gradient methods for minimax problems

Abstract

Publication series

Conference

Access to Document

Fingerprint

Cite this