利用频率分析，以进行深度假图像识别

论文标题

利用频率分析，以进行深度假图像识别

Leveraging Frequency Analysis for Deep Fake Image Recognition

论文作者

Frank, Joel, Eisenhofer, Thorsten, Schönherr, Lea, Fischer, Asja, Kolossa, Dorothea, Holz, Thorsten

论文摘要

深度神经网络可以产生惊人的现实图像，以至于人类通常很难将它们与实际照片区分开。这些成就在很大程度上是通过生成对抗网络（GAN）实现的。尽管图像域（来自图像取证领域的经典方法）在图像域中进行了彻底研究，但到目前为止，频域的分析已经缺失。在本文中，我们解决了这一缺点，结果表明，在频率空间中，gan生成的图像表现出可轻松识别的严重伪影。我们进行了全面的分析，表明这些伪像在不同的神经网络架构，数据集和决议之间是一致的。在进一步的研究中，我们证明了这些伪像是由当前所有gan体系结构中发现的上采样操作引起的，这表明通过gan产生图像的结构和基本问题。基于此分析，我们演示了如何使用频率表示以自动化的方式识别深层假图像，从而超过了最新方法。

Deep neural networks can generate images that are astonishingly realistic, so much so that it is often hard for humans to distinguish them from actual photos. These achievements have been largely made possible by Generative Adversarial Networks (GANs). While deep fake images have been thoroughly investigated in the image domain - a classical approach from the area of image forensics - an analysis in the frequency domain has been missing so far. In this paper, we address this shortcoming and our results reveal that in frequency space, GAN-generated images exhibit severe artifacts that can be easily identified. We perform a comprehensive analysis, showing that these artifacts are consistent across different neural network architectures, data sets, and resolutions. In a further investigation, we demonstrate that these artifacts are caused by upsampling operations found in all current GAN architectures, indicating a structural and fundamental problem in the way images are generated via GANs. Based on this analysis, we demonstrate how the frequency representation can be used to identify deep fake images in an automated way, surpassing state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题