论文标题
NASB:神经建筑搜索二进制卷积神经网络
NASB: Neural Architecture Search for Binary Convolutional Neural Networks
论文作者
论文摘要
二进制卷积神经网络(CNN)大大减少了CNN所需的算术操作数量和内存存储的大小,这使得它们在移动和嵌入式系统上的部署更加可行。但是,由于两个原因,二进制后的CNN架构需要重新设计和精制。尽管已经在为单个和多个二进制CNN设计架构上投入了大量努力,但仍然很难找到用于二进制CNN的最佳体系结构。在本文中,我们提出了一种名为NASB的策略,该策略采用神经体系结构搜索(NAS)来找到用于CNNS二进制的最佳体系结构。由于这种自动化策略的灵活性,所获得的体系结构不仅适用于二进制化,而且开销较低,在手工优化二进制CNN的准确性和计算复杂性之间取决于更好的权衡。在ImageNet数据集上评估了NASB策略的实施,与现有量化的CNN相比,它被证明是更好的解决方案。随着无关紧要的间接费用,NASB的表现分别优于现有的单个和多个二进制CNN,高达4.0%和1.0%的Top-1精度,使它们更接近其完整的精度对应物的精度。代码和预估计的模型将公开可用。
Binary Convolutional Neural Networks (CNNs) have significantly reduced the number of arithmetic operations and the size of memory storage needed for CNNs, which makes their deployment on mobile and embedded systems more feasible. However, the CNN architecture after binarizing requires to be redesigned and refined significantly due to two reasons: 1. the large accumulation error of binarization in the forward propagation, and 2. the severe gradient mismatch problem of binarization in the backward propagation. Even though the substantial effort has been invested in designing architectures for single and multiple binary CNNs, it is still difficult to find an optimal architecture for binary CNNs. In this paper, we propose a strategy, named NASB, which adopts Neural Architecture Search (NAS) to find an optimal architecture for the binarization of CNNs. Due to the flexibility of this automated strategy, the obtained architecture is not only suitable for binarization but also has low overhead, achieving a better trade-off between the accuracy and computational complexity of hand-optimized binary CNNs. The implementation of NASB strategy is evaluated on the ImageNet dataset and demonstrated as a better solution compared to existing quantized CNNs. With the insignificant overhead increase, NASB outperforms existing single and multiple binary CNNs by up to 4.0% and 1.0% Top-1 accuracy respectively, bringing them closer to the precision of their full precision counterpart. The code and pretrained models will be publicly available.