与有偏见的委员会学习辩护的分类器

论文标题

与有偏见的委员会学习辩护的分类器

Learning Debiased Classifier with Biased Committee

论文作者

Kim, Nayeong, Hwang, Sehyun, Ahn, Sungsoo, Park, Jaesik, Kwak, Suha

论文摘要

神经网络倾向于在训练数据的主要部分中表现出的类和潜在属性之间的虚假相关性，这破坏了它们的概括能力。我们提出了一种新的方法，用于培训没有虚假属性标签的伪造分类器。关键思想是将分类器委员会作为一个辅助模块，该模块可以识别出偏见的数据，即没有伪造相关性的数据，并在训练主要分类器时向它们分配了很大的权重。该委员会被学到了一个自举的合奏，因此大多数分类器都具有偏见和多样化，并且故意无法相应地预测偏见的偏见。因此，预测难度委员会的共识为识别和加权偏见冲突数据提供了可靠的提示。此外，该委员会还接受了从主要分类器转移的知识的培训，因此随着培训的进展，它逐渐与主要分类器一起变得偏见，并强调更困难的数据。在五个现实世界数据集上，我们的方法在没有像我们这样的虚假属性标签上优于先前的艺术，甚至超过偶尔依靠偏见标签的艺术。

Neural networks are prone to be biased towards spurious correlations between classes and latent attributes exhibited in a major portion of training data, which ruins their generalization capability. We propose a new method for training debiased classifiers with no spurious attribute label. The key idea is to employ a committee of classifiers as an auxiliary module that identifies bias-conflicting data, i.e., data without spurious correlation, and assigns large weights to them when training the main classifier. The committee is learned as a bootstrapped ensemble so that a majority of its classifiers are biased as well as being diverse, and intentionally fail to predict classes of bias-conflicting data accordingly. The consensus within the committee on prediction difficulty thus provides a reliable cue for identifying and weighting bias-conflicting data. Moreover, the committee is also trained with knowledge transferred from the main classifier so that it gradually becomes debiased along with the main classifier and emphasizes more difficult data as training progresses. On five real-world datasets, our method outperforms prior arts using no spurious attribute label like ours and even surpasses those relying on bias labels occasionally.

下载PDF全文

下载文献需遵守相关版权规定

论文标题