垂直联合学习的公平有效贡献估值

论文标题

垂直联合学习的公平有效贡献估值

Fair and efficient contribution valuation for vertical federated learning

论文作者

Fan, Zhenan, Fang, Huang, Wang, Xinglu, Zhou, Zirui, Pei, Jian, Friedlander, Michael P., Zhang, Yong

论文摘要

联合学习是一种新兴技术，用于跨分散数据源培训机器学习模型，而无需共享数据。垂直联合学习，也称为基于特征的联合学习，适用于数据源具有相同示例ID但功能集相同的方案。为了确保数据所有者之间的公平性，至关重要的是，客观地评估来自不同数据源的贡献并相应地补偿相应的数据所有者。 Shapley的价值是源自合作游戏理论的公平贡献估值公制。但是，它的直接计算需要在数据源的每种潜在组合上进行广泛的检验模型，从而导致由于多回合联合学习而导致高度高的沟通和计算开销。为了应对这一挑战，我们提出了一个基于经典沙普利价值的贡献评估公制，称为垂直联合沙普利价值（Verfedsv）。我们表明，VerfedSV不仅满足了许多理想的公平特性，而且还可以有效计算。此外，VERFEDSV可以适应同步和异步垂直联合学习算法。理论分析和广泛的实验结果都证明了VerfedSV的公平性，效率，适应性和有效性。

Federated learning is an emerging technology for training machine learning models across decentralized data sources without sharing data. Vertical federated learning, also known as feature-based federated learning, applies to scenarios where data sources have the same sample IDs but different feature sets. To ensure fairness among data owners, it is critical to objectively assess the contributions from different data sources and compensate the corresponding data owners accordingly. The Shapley value is a provably fair contribution valuation metric originating from cooperative game theory. However, its straight-forward computation requires extensively retraining a model on each potential combination of data sources, leading to prohibitively high communication and computation overheads due to multiple rounds of federated learning. To tackle this challenge, we propose a contribution valuation metric called vertical federated Shapley value (VerFedSV) based on the classic Shapley value. We show that VerFedSV not only satisfies many desirable properties of fairness but is also efficient to compute. Moreover, VerFedSV can be adapted to both synchronous and asynchronous vertical federated learning algorithms. Both theoretical analysis and extensive experimental results demonstrate the fairness, efficiency, adaptability, and effectiveness of VerFedSV.

下载PDF全文

下载文献需遵守相关版权规定

论文标题