论文标题
M $^5 $ L:用于RGBT跟踪的多模式多修理度量学习
M$^5$L: Multi-Modal Multi-Margin Metric Learning for RGBT Tracking
论文作者
论文摘要
在RGBT跟踪过程中对令人困惑的样本进行分类是一个非常具有挑战性的问题,这并不满意。现有方法仅着重于扩大正面样本和负面样本之间的边界,但是,样本的结构化信息可能会受到损害,例如,混淆的正样品比正常的正样品更接近锚点。要处理这个问题,我们提出了一个新颖的多模式多模式的多模式度量度量学习框架,名为M $^5 $^5 $ lgbt Tracking in Chip in Chip in Perpers in Chip in Chips in Perpers in Chip in Perpers中。特别是,我们设计了多余的结构化损失,以区分令人困惑的样本,这些样本在跟踪性能提升中起着最关键的作用。为了减轻这个问题,我们还通过利用每个模态中所有样本中所有样本的结构化信息来降低样本的结构化信息,从而扩大混淆积极样本和正常样本之间的界限,而正常样本与预定的边缘之间的界限。更重要的是,采用交叉模式约束来降低模态的差异,并降低了差异的差异。融合,我们介绍了模式的注意,并使用网络中的功能融合模块来学习它们。大规模数据集的广泛实验证明,我们的框架显然改善了跟踪性能,并表现优于最先进的RGBT跟踪器。
Classifying the confusing samples in the course of RGBT tracking is a quite challenging problem, which hasn't got satisfied solution. Existing methods only focus on enlarging the boundary between positive and negative samples, however, the structured information of samples might be harmed, e.g., confusing positive samples are closer to the anchor than normal positive samples.To handle this problem, we propose a novel Multi-Modal Multi-Margin Metric Learning framework, named M$^5$L for RGBT tracking in this paper. In particular, we design a multi-margin structured loss to distinguish the confusing samples which play a most critical role in tracking performance boosting. To alleviate this problem, we additionally enlarge the boundaries between confusing positive samples and normal ones, between confusing negative samples and normal ones with predefined margins, by exploiting the structured information of all samples in each modality.Moreover, a cross-modality constraint is employed to reduce the difference between modalities and push positive samples closer to the anchor than negative ones from two modalities.In addition, to achieve quality-aware RGB and thermal feature fusion, we introduce the modality attentions and learn them using a feature fusion module in our network. Extensive experiments on large-scale datasets testify that our framework clearly improves the tracking performance and outperforms the state-of-the-art RGBT trackers.