论文标题
学习预测半监督的持续学习
Learning to Predict Gradients for Semi-Supervised Continual Learning
论文作者
论文摘要
机器智能的关键挑战是学习新的视觉概念,而不会忘记先前获得的知识。持续学习旨在应对这一挑战。但是,现有的监督持续学习与类似人类的智力之间存在差距,人类能够从标记和未标记的数据中学习。未标记的数据如何影响持续学习过程中的学习和灾难性遗忘仍然未知。为了探索这些问题,我们制定了一种新的半监督连续学习方法,该方法可以通常应用于现有的持续学习模型。具体而言,一个新颖的梯度学习者从标记的数据中学习,以预测未标记数据的梯度。因此,未标记的数据可以适合监督的持续学习方法。与传统的半监督设置不同,我们没有假设学习过程已知与未标记数据相关的基础类别。换句话说,未标记的数据可能与标记的数据非常不同。我们评估了主流持续学习,对抗性持续学习和半监督学习任务的建议方法。所提出的方法在连续学习环境中实现了分类精度和向后转移的最新性能,同时在半手定学习设置中实现了分类准确性的期望性能。这意味着未标记的图像可以增强对看不见数据的预测能力的连续学习模型的普遍性,并显着减轻灾难性的遗忘。该代码可在\ url {https://github.com/luoyan407/grad_prediction.git}中获得。
A key challenge for machine intelligence is to learn new visual concepts without forgetting the previously acquired knowledge. Continual learning is aimed towards addressing this challenge. However, there is a gap between existing supervised continual learning and human-like intelligence, where human is able to learn from both labeled and unlabeled data. How unlabeled data affects learning and catastrophic forgetting in the continual learning process remains unknown. To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models. Specifically, a novel gradient learner learns from labeled data to predict gradients on unlabeled data. Hence, the unlabeled data could fit into the supervised continual learning method. Different from conventional semi-supervised settings, we do not hypothesize that the underlying classes, which are associated to the unlabeled data, are known to the learning process. In other words, the unlabeled data could be very distinct from the labeled data. We evaluate the proposed method on mainstream continual learning, adversarial continual learning, and semi-supervised learning tasks. The proposed method achieves state-of-the-art performance on classification accuracy and backward transfer in the continual learning setting while achieving desired performance on classification accuracy in the semi-supervised learning setting. This implies that the unlabeled images can enhance the generalizability of continual learning models on the predictive ability on unseen data and significantly alleviate catastrophic forgetting. The code is available at \url{https://github.com/luoyan407/grad_prediction.git}.