通过学习的控制策略对移动目标进行主动分类

论文标题

通过学习的控制策略对移动目标进行主动分类

Active Classification of Moving Targets with Learned Control Policies

论文作者

Serra-Gómez, Álvaro, Montijano, Eduardo, Böhmer, Wendelin, Alonso-Mora, Javier

论文摘要

在本文中，我们考虑了无人机必须收集语义信息以对多个移动目标进行分类的问题。特别是，当使用“ Black-Box”分类器（例如深度学习神经网络）提取信息时，我们将解决将无人机转移到信息视点，位置和方向的计算控制输入的挑战。这些算法通常缺乏观点及其相关输出之间的分析关系，从而阻止了它们在信息收集方案中的使用。为了填补这一空白，我们提出了一种通过强化学习（RL）训练的新型基于注意力的建筑，该体系结构为无人机的下一个观点输出，而有利于从尽可能多的未分类目标中获取证据，同时推理其运动，方向和遮挡。然后，我们使用低级MPC控制器将无人机移至所需的观点，考虑到其实际动态。我们表明，我们的方法不仅表现出色，而且还概括了在训练过程中看不见的场景。此外，我们表明网络缩放到大量目标，并将其推广到目标的不同运动动力学。

In this paper, we consider the problem where a drone has to collect semantic information to classify multiple moving targets. In particular, we address the challenge of computing control inputs that move the drone to informative viewpoints, position and orientation, when the information is extracted using a "black-box" classifier, e.g., a deep learning neural network. These algorithms typically lack of analytical relationships between the viewpoints and their associated outputs, preventing their use in information-gathering schemes. To fill this gap, we propose a novel attention-based architecture, trained via Reinforcement Learning (RL), that outputs the next viewpoint for the drone favoring the acquisition of evidence from as many unclassified targets as possible while reasoning about their movement, orientation, and occlusions. Then, we use a low-level MPC controller to move the drone to the desired viewpoint taking into account its actual dynamics. We show that our approach not only outperforms a variety of baselines but also generalizes to scenarios unseen during training. Additionally, we show that the network scales to large numbers of targets and generalizes well to different movement dynamics of the targets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题