On the Importance and Applicability of Pre-Training for Federated Learning

Chen, Hong-You; Tu, Cheng-Hao; Li, Ziwei; Shen, Han-Wei; Chao, Wei-Lun

Computer Science > Machine Learning

arXiv:2206.11488 (cs)

[Submitted on 23 Jun 2022 (v1), last revised 23 Mar 2023 (this version, v3)]

Title:On the Importance and Applicability of Pre-Training for Federated Learning

Authors:Hong-You Chen, Cheng-Hao Tu, Ziwei Li, Han-Wei Shen, Wei-Lun Chao

View PDF

Abstract:Pre-training is prevalent in nowadays deep learning to improve the learned model's performance. However, in the literature on federated learning (FL), neural networks are mostly initialized with random weights. These attract our interest in conducting a systematic study to explore pre-training for FL. Across multiple visual recognition benchmarks, we found that pre-training can not only improve FL, but also close its accuracy gap to the counterpart centralized learning, especially in the challenging cases of non-IID clients' data. To make our findings applicable to situations where pre-trained models are not directly available, we explore pre-training with synthetic data or even with clients' data in a decentralized manner, and found that they can already improve FL notably. Interestingly, many of the techniques we explore are complementary to each other to further boost the performance, and we view this as a critical result toward scaling up deep FL for real-world applications. We conclude our paper with an attempt to understand the effect of pre-training on FL. We found that pre-training enables the learned global models under different clients' data conditions to converge to the same loss basin, and makes global aggregation in FL more stable. Nevertheless, pre-training seems to not alleviate local model drifting, a fundamental problem in FL under non-IID data.

Comments:	Accepted to ICLR 2023
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.11488 [cs.LG]
	(or arXiv:2206.11488v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.11488

Submission history

From: Hong-You Chen [view email]
[v1] Thu, 23 Jun 2022 06:02:33 UTC (12,759 KB)
[v2] Thu, 27 Oct 2022 17:30:15 UTC (15,963 KB)
[v3] Thu, 23 Mar 2023 03:27:40 UTC (13,778 KB)

Computer Science > Machine Learning

Title:On the Importance and Applicability of Pre-Training for Federated Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Importance and Applicability of Pre-Training for Federated Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators