论文标题
高分辨率深图像垫子
High-Resolution Deep Image Matting
论文作者
论文摘要
图像垫是图像和视频编辑和组成的关键技术。从传统上讲,深度学习方法采用了整个输入图像和相关的三张图,可以使用卷积神经网络推断alpha哑光。这种方法在图像垫子中设置了最新的方法;但是,由于硬件限制,它们可能会在现实世界中的应用程序中失败,因为用于垫子的现实世界输入图像大多是非常高的分辨率。在本文中,我们提出了HDMATT,这是用于高分辨率输入的第一个基于深度学习的图像垫方法。更具体地说,HDMATT以基于补丁的作物和缝隙方式运行,以使用新型模块设计,以解决不同斑块之间的上下文依赖性和一致性问题。与基于香草补丁的推理相比,我们独立地计算每个贴片的推断,我们将通过给定的Trimap指导的新填充的交叉点上下文模块(CPC)明确对交叉点上下文依赖性进行建模。广泛的实验证明了该方法的有效性及其对高分辨率输入的必要性。我们的HDMATT方法还为Adobe Image Matting和Alphamatting Benchmarks设置了新的最新性能,并在更真实的高分辨率图像上产生令人印象深刻的视觉结果。
Image matting is a key technique for image and video editing and composition. Conventionally, deep learning approaches take the whole input image and an associated trimap to infer the alpha matte using convolutional neural networks. Such approaches set state-of-the-arts in image matting; however, they may fail in real-world matting applications due to hardware limitations, since real-world input images for matting are mostly of very high resolution. In this paper, we propose HDMatt, a first deep learning based image matting approach for high-resolution inputs. More concretely, HDMatt runs matting in a patch-based crop-and-stitch manner for high-resolution inputs with a novel module design to address the contextual dependency and consistency issues between different patches. Compared with vanilla patch-based inference which computes each patch independently, we explicitly model the cross-patch contextual dependency with a newly-proposed Cross-Patch Contextual module (CPC) guided by the given trimap. Extensive experiments demonstrate the effectiveness of the proposed method and its necessity for high-resolution inputs. Our HDMatt approach also sets new state-of-the-art performance on Adobe Image Matting and AlphaMatting benchmarks and produce impressive visual results on more real-world high-resolution images.