An experimental study of class imbalance in federated learning

Chenguang Xiao; Shuo Wang

doi:10.1109/SSCI50451.2021.9660072

An experimental study of class imbalance in federated learning

Chenguang Xiao, Shuo Wang

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

60 Downloads (Pure)

Abstract

Federated learning is a distributed machine learning paradigm that trains a global model for prediction based on several local models at clients while local data privacy is preserved. Class imbalance is believed to be one of the factors that degrades the global model performance. However, there has been very little research on if and how class imbalance can affect the global performance in various imbalance scenarios. Class imbalance in federated learning is much more complex than that in traditional non-distributed machine learning, due to different class imbalance situations at local clients. Class imbalance needs to be re-defined in distributed learning environments, so that corresponding solutions can be proposed. In this paper, first, we propose two new metrics to define class imbalance – the global class imbalance degree (MID) and the local difference of class imbalance among clients (WCS). Class imbalance is categorized into four scenarios under the definition. Then, we conduct extensive experiments to analyze the impact of class imbalance on the global performance in various scenarios. Our results show that a higher MID and a larger WCS degrade more the performance of the global model. Besides, WCS is shown to slow down the convergence of the global model by misdirecting the optimization.

Original language	English
Title of host publication	2021 IEEE Symposium Series on Computational Intelligence (SSCI)
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Number of pages	7
ISBN (Electronic)	9781728190488
ISBN (Print)	9781728190495 (PoD)
DOIs	https://doi.org/10.1109/SSCI50451.2021.9660072
Publication status	Published - 24 Jan 2022
Event	IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021) - Orlando, United States Duration: 5 Dec 2021 → 7 Dec 2021

Publication series

Name	IEEE Symposium Series on Computational Intelligence
Publisher	IEEE
ISSN (Electronic)	2770-0097

Conference

Conference	IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021)
Abbreviated title	IEEE SSCI 2021
Country/Territory	United States
City	Orlando
Period	5/12/21 → 7/12/21

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Keywords

class imbalance
federated learning
multiclass classification
Federated learning
Class imbalance
Multiclass classification

ASJC Scopus subject areas

Artificial Intelligence
Decision Sciences (miscellaneous)
Control and Optimization
Safety, Risk, Reliability and Quality
Computer Science Applications

Access to Document

10.1109/SSCI50451.2021.9660072

XiaoC2021experimental
C. Xiao and S. Wang, "An Experimental Study of Class Imbalance in Federated Learning," 2021 IEEE Symposium Series on Computational Intelligence (SSCI), 2021, pp. 1-7, doi: 10.1109/SSCI50451.2021.9660072. © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted author manuscript, 255 KBLicence: None: All rights reserved

Cite this

@inproceedings{d8b7d6581ce64254bf75d766b5176877,

title = "An experimental study of class imbalance in federated learning",

abstract = "Federated learning is a distributed machine learning paradigm that trains a global model for prediction based on several local models at clients while local data privacy is preserved. Class imbalance is believed to be one of the factors that degrades the global model performance. However, there has been very little research on if and how class imbalance can affect the global performance in various imbalance scenarios. Class imbalance in federated learning is much more complex than that in traditional non-distributed machine learning, due to different class imbalance situations at local clients. Class imbalance needs to be re-defined in distributed learning environments, so that corresponding solutions can be proposed. In this paper, first, we propose two new metrics to define class imbalance – the global class imbalance degree (MID) and the local difference of class imbalance among clients (WCS). Class imbalance is categorized into four scenarios under the definition. Then, we conduct extensive experiments to analyze the impact of class imbalance on the global performance in various scenarios. Our results show that a higher MID and a larger WCS degrade more the performance of the global model. Besides, WCS is shown to slow down the convergence of the global model by misdirecting the optimization.",

keywords = "class imbalance, federated learning, multiclass classification, Federated learning, Class imbalance, Multiclass classification",

author = "Chenguang Xiao and Shuo Wang",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021), IEEE SSCI 2021 ; Conference date: 05-12-2021 Through 07-12-2021",

year = "2022",

month = jan,

day = "24",

doi = "10.1109/SSCI50451.2021.9660072",

language = "English",

isbn = "9781728190495 (PoD)",

series = "IEEE Symposium Series on Computational Intelligence",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

booktitle = "2021 IEEE Symposium Series on Computational Intelligence (SSCI)",

}

Xiao, C & Wang, S 2022, An experimental study of class imbalance in federated learning. in 2021 IEEE Symposium Series on Computational Intelligence (SSCI)., 9660072, IEEE Symposium Series on Computational Intelligence, Institute of Electrical and Electronics Engineers (IEEE), IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021), Orlando, Florida, United States, 5/12/21. https://doi.org/10.1109/SSCI50451.2021.9660072

An experimental study of class imbalance in federated learning. / Xiao, Chenguang; Wang, Shuo.
2021 IEEE Symposium Series on Computational Intelligence (SSCI). Institute of Electrical and Electronics Engineers (IEEE), 2022. 9660072 (IEEE Symposium Series on Computational Intelligence).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - An experimental study of class imbalance in federated learning

AU - Xiao, Chenguang

AU - Wang, Shuo

PY - 2022/1/24

Y1 - 2022/1/24

N2 - Federated learning is a distributed machine learning paradigm that trains a global model for prediction based on several local models at clients while local data privacy is preserved. Class imbalance is believed to be one of the factors that degrades the global model performance. However, there has been very little research on if and how class imbalance can affect the global performance in various imbalance scenarios. Class imbalance in federated learning is much more complex than that in traditional non-distributed machine learning, due to different class imbalance situations at local clients. Class imbalance needs to be re-defined in distributed learning environments, so that corresponding solutions can be proposed. In this paper, first, we propose two new metrics to define class imbalance – the global class imbalance degree (MID) and the local difference of class imbalance among clients (WCS). Class imbalance is categorized into four scenarios under the definition. Then, we conduct extensive experiments to analyze the impact of class imbalance on the global performance in various scenarios. Our results show that a higher MID and a larger WCS degrade more the performance of the global model. Besides, WCS is shown to slow down the convergence of the global model by misdirecting the optimization.

AB - Federated learning is a distributed machine learning paradigm that trains a global model for prediction based on several local models at clients while local data privacy is preserved. Class imbalance is believed to be one of the factors that degrades the global model performance. However, there has been very little research on if and how class imbalance can affect the global performance in various imbalance scenarios. Class imbalance in federated learning is much more complex than that in traditional non-distributed machine learning, due to different class imbalance situations at local clients. Class imbalance needs to be re-defined in distributed learning environments, so that corresponding solutions can be proposed. In this paper, first, we propose two new metrics to define class imbalance – the global class imbalance degree (MID) and the local difference of class imbalance among clients (WCS). Class imbalance is categorized into four scenarios under the definition. Then, we conduct extensive experiments to analyze the impact of class imbalance on the global performance in various scenarios. Our results show that a higher MID and a larger WCS degrade more the performance of the global model. Besides, WCS is shown to slow down the convergence of the global model by misdirecting the optimization.

KW - class imbalance

KW - federated learning

KW - multiclass classification

KW - Federated learning

KW - Class imbalance

KW - Multiclass classification

UR - https://arxiv.org/abs/2109.04094

UR - http://www.scopus.com/inward/record.url?scp=85125790648&partnerID=8YFLogxK

U2 - 10.1109/SSCI50451.2021.9660072

DO - 10.1109/SSCI50451.2021.9660072

M3 - Conference contribution

SN - 9781728190495 (PoD)

T3 - IEEE Symposium Series on Computational Intelligence

BT - 2021 IEEE Symposium Series on Computational Intelligence (SSCI)

PB - Institute of Electrical and Electronics Engineers (IEEE)

T2 - IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021)

Y2 - 5 December 2021 through 7 December 2021

ER -

An experimental study of class imbalance in federated learning

Abstract

Publication series

Conference

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this