幼稚的几次学习：发现机器的流体智能

论文标题

幼稚的几次学习：发现机器的流体智能

Naive Few-Shot Learning: Uncovering the fluid intelligence of machines

论文作者

Barak, Tomer, Loewenstein, Yonatan

论文摘要

在本文中，我们旨在帮助弥合人体流体智能之间的差距 - 人体流体智能的差距 - 无需事先培训即可解决新任务的能力 - 以及深度神经网络的表现，这通常需要大量的先前培训。解决智力测试的基本认知成分，在人类中用于测量流体智能，是识别序列规律性的能力。这促使我们构建了一个基准任务，我们将其称为\ textit {semence一致性评估}（SCE），其解决方案需要能够识别序列中的规律性。鉴于深层网络的可靠能力，他们在大规模培训后可以解决此类任务的能力。但是，令人惊讶的是，我们表明，在\ textIt {single {single} sce上培训的幼稚（随机初始化的）深度学习模型，具有\ textit {single {single}优化步骤仍然可以求解任务的非琐事版本。我们扩展了我们的发现以解决视觉和听觉方式中的任何事先培训，现实世界中的异常检测任务。这些结果证明了深网的流体智能计算能力。我们讨论我们工作对构建流体智能机器的含义。

In this paper, we aimed to help bridge the gap between human fluid intelligence - the ability to solve novel tasks without prior training - and the performance of deep neural networks, which typically require extensive prior training. An essential cognitive component for solving intelligence tests, which in humans are used to measure fluid intelligence, is the ability to identify regularities in sequences. This motivated us to construct a benchmark task, which we term \textit{sequence consistency evaluation} (SCE), whose solution requires the ability to identify regularities in sequences. Given the proven capabilities of deep networks, their ability to solve such tasks after extensive training is expected. Surprisingly, however, we show that naive (randomly initialized) deep learning models that are trained on a \textit{single} SCE with a \textit{single} optimization step can still solve non-trivial versions of the task relatively well. We extend our findings to solve, without any prior training, real-world anomaly detection tasks in the visual and auditory modalities. These results demonstrate the fluid-intelligent computational capabilities of deep networks. We discuss the implications of our work for constructing fluid-intelligent machines.

下载PDF全文

下载文献需遵守相关版权规定

论文标题