MadFed: enhancing federated learning with marginal-data model fusion
2023 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 11, p. 102669-102680
Article in journal (Refereed) Published
Abstract [en]
As the demand for intelligent applications at the network edge grows, so does the need for effective federated learning (FL) techniques. However, FL often relies on non-identically and non-independently distributed local datasets across end devices, which could result in considerable performance degradation. Prior solutions, such as model-driven approaches based on knowledge distillation, meta-learning, and transfer learning, have provided some reprieve. However, their performance suffers under heterogeneous local datasets and highly skewed data distributions. To address these challenges, this study introduces the MArginal Data fusion FEDerated Learning (MadFed) approach, a groundbreaking fusion of model- and data-driven methodologies. By utilizing marginal data, MadFed mitigates data distribution skewness, improves the maximum achievable accuracy, and reduces communication costs. Furthermore, the study demonstrates that the fusion of marginal data can significantly improve performance even with minimal data entries, such as a single entry. For instance, it provides up to a 15.4% accuracy increase and 70.4% communication cost savings when combined with established model-driven methodologies. Conversely, relying solely on these model-driven methodologies can result in poor performance, especially with highly skewed datasets. Significantly, MadFed extends its effectiveness across various FL algorithms and offers a unique method to augment label sets of end devices, thereby enhancing the utility and applicability of federated learning in real-world scenarios. The proposed approach is not only efficient but also adaptable and versatile, promising broader application and potential for widespread adoption in the field.
Place, publisher, year, edition, pages
IEEE, 2023. Vol. 11, p. 102669-102680
Keywords [en]
Computational modeling, Costs, Data integration, Data models, Edge computing, Edge Computing, Federated learning, Federated learning, Performance evaluation, Performance evaluation, Training
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-214778DOI: 10.1109/ACCESS.2023.3315654Scopus ID: 2-s2.0-85171574845OAI: oai:DiVA.org:umu-214778DiVA, id: diva2:1801568
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Swedish National Infrastructure for Computing (SNIC)Knut and Alice Wallenberg Foundation2023-10-022023-10-022023-10-02Bibliographically approved