头：异质对象探测器的异助助剂蒸馏

论文标题

头：异质对象探测器的异助助剂蒸馏

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors

论文作者

Wang, Luting, Li, Xiaojie, Liao, Yue, Jiang, Zeren, Wu, Jianlong, Wang, Fei, Qian, Chen, Liu, Si

论文摘要

用于对象检测的常规知识蒸馏（KD）方法主要集中于同质的教师学生探测器。但是，用于部署的轻质检测器的设计通常与高容量探测器显着不同。因此，我们研究了非均质教师对之间的KD，以进行广泛的应用。我们观察到，异质KD（异质KD）的核心难度是由于不同优化的方式，异质探测器的主链特征之间的显着语义差距。常规的同质KD（HOMO-KD）方法遭受了这种差距的影响，并且很难直接获得异质KD的令人满意的性能。在本文中，我们提出了异助剂蒸馏（头）框架，利用异质检测头作为助手来指导学生探测器的优化以减少这一差距。在头上，助手是一个额外的检测头，其建筑与学生骨干的老师负责人同质性。因此，将异源KD转变为同性恋，从而可以有效地从老师到学生的知识转移。此外，当训练有素的教师探测器不可用时，我们将头部扩展到一个无教师的头（TF-Head）框架。与当前检测KD方法相比，我们的方法已取得了显着改善。例如，在MS-COCO数据集上，TF-Head可帮助R18视网膜实现33.9 MAP（+2.2），而Head将极限进一步推到36.2 MAP（+4.5）。

Conventional knowledge distillation (KD) methods for object detection mainly concentrate on homogeneous teacher-student detectors. However, the design of a lightweight detector for deployment is often significantly different from a high-capacity detector. Thus, we investigate KD among heterogeneous teacher-student pairs for a wide application. We observe that the core difficulty for heterogeneous KD (hetero-KD) is the significant semantic gap between the backbone features of heterogeneous detectors due to the different optimization manners. Conventional homogeneous KD (homo-KD) methods suffer from such a gap and are hard to directly obtain satisfactory performance for hetero-KD. In this paper, we propose the HEtero-Assists Distillation (HEAD) framework, leveraging heterogeneous detection heads as assistants to guide the optimization of the student detector to reduce this gap. In HEAD, the assistant is an additional detection head with the architecture homogeneous to the teacher head attached to the student backbone. Thus, a hetero-KD is transformed into a homo-KD, allowing efficient knowledge transfer from the teacher to the student. Moreover, we extend HEAD into a Teacher-Free HEAD (TF-HEAD) framework when a well-trained teacher detector is unavailable. Our method has achieved significant improvement compared to current detection KD methods. For example, on the MS-COCO dataset, TF-HEAD helps R18 RetinaNet achieve 33.9 mAP (+2.2), while HEAD further pushes the limit to 36.2 mAP (+4.5).

下载PDF全文

下载文献需遵守相关版权规定

论文标题