证明是一致的部分标签学习

论文标题

证明是一致的部分标签学习

Provably Consistent Partial-Label Learning

论文作者

Feng, Lei, Lv, Jiaqi, Han, Bo, Xu, Miao, Niu, Gang, Geng, Xin, An, Bo, Sugiyama, Masashi

论文摘要

部分标签学习（PLL）是一个多类分类问题，每个培训示例都与一组候选标签相关联。尽管在过去的二十年中提出了许多实用的PLL方法，但仍缺乏对这些方法的一致性的理论理解，迄今为止，迄今为止，没有PLL方法的一致性具有候选标签集的生成过程，然后尚不清楚为什么这种方法在特定数据集上可在特定数据集上起作用，并且何时可以通过不同的数据集进行失败。在本文中，我们提出了候选标签集的第一代模型，并开发了两种新颖的PLL方法，这些方法可以证明是一致的，即一种是风险一致的，另一种是分类器的持续性。我们的方法是有利的，因为它们与任何深网或随机优化器都兼容。此外，由于生成模型，我们可以通过测试生成模型是否匹配给定的候选标签集来回答上面的两个问题。基准和现实世界数据集的实验验证了提出的生成模型和两种PLL方法的有效性。

Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels. Even though many practical PLL methods have been proposed in the last two decades, there lacks a theoretical understanding of the consistency of those methods-none of the PLL methods hitherto possesses a generation process of candidate label sets, and then it is still unclear why such a method works on a specific dataset and when it may fail given a different dataset. In this paper, we propose the first generation model of candidate label sets, and develop two novel PLL methods that are guaranteed to be provably consistent, i.e., one is risk-consistent and the other is classifier-consistent. Our methods are advantageous, since they are compatible with any deep network or stochastic optimizer. Furthermore, thanks to the generation model, we would be able to answer the two questions above by testing if the generation model matches given candidate label sets. Experiments on benchmark and real-world datasets validate the effectiveness of the proposed generation model and two PLL methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题