论文标题
视觉场所识别的分层多进程融合
Hierarchical Multi-Process Fusion for Visual Place Recognition
论文作者
论文摘要
长期以来,将多种互补技术组合在一起,一直被视为提高性能的一种方式。在视觉定位,多传感器融合,单个感应方式的多进程融合,甚至不同定位技术的组合都会导致性能提高。但是,仅将不同的本地化技术融合在一起并不能说明不同定位技术的不同性能特征。在本文中,我们提出了一个新颖的,分层的定位系统,该系统明确受益于三个不同特征的定位技术:其本地化假设的分布,外观和视点不变的属性以及每个系统在环境中的差异良好。我们展示了如何在层次上部署的两种技术比并行融合更好,即即使单个技术具有较高的个体性能,并开发了两个和三层的层次结构,从而逐步提高了本地化性能。最后,我们开发了一个堆叠的层次结构框架,其中将来自具有互补特性的技术的定位假设在每一层都串联,从而显着提高了正确假设到最终定位阶段的保留。使用两个具有挑战性的数据集,我们显示所提出的系统优于最先进的技术。
Combining multiple complementary techniques together has long been regarded as a way to improve performance. In visual localization, multi-sensor fusion, multi-process fusion of a single sensing modality, and even combinations of different localization techniques have been shown to result in improved performance. However, merely fusing together different localization techniques does not account for the varying performance characteristics of different localization techniques. In this paper we present a novel, hierarchical localization system that explicitly benefits from three varying characteristics of localization techniques: the distribution of their localization hypotheses, their appearance- and viewpoint-invariant properties, and the resulting differences in where in an environment each system works well and fails. We show how two techniques deployed hierarchically work better than in parallel fusion, how combining two different techniques works better than two levels of a single technique, even when the single technique has superior individual performance, and develop two and three-tier hierarchical structures that progressively improve localization performance. Finally, we develop a stacked hierarchical framework where localization hypotheses from techniques with complementary characteristics are concatenated at each layer, significantly improving retention of the correct hypothesis through to the final localization stage. Using two challenging datasets, we show the proposed system outperforming state-of-the-art techniques.