论文标题
Joslim:可轻微神经网络的联合宽度和权重优化
Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks
论文作者
论文摘要
微弱的神经网络在预测错误和计算要求(例如浮点操作或拖船的数量)之间提供了与单个模型相同的存储要求之间的灵活权衡方面。它们可用于减少将模型部署到具有不同内存限制的设备的维护开销,并且对于优化许多CNN的系统效率很有用。但是,现有的微小网络方法要么不会优化层的宽度,要么独立地优化共享权重和层的宽度,从而为关节宽度和重量优化留下了很大的改进空间。在这项工作中,我们提出了一个通用框架,以实现对宽度配置和轻巧网络的权重的关节优化。我们的框架将传统和基于NAS的可靠方法作为特殊情况,并为改进现有方法提供了灵活性。从实际的角度来看,我们提出了Joslim,这是一种算法,可以共同优化可靠网的宽度和权重,该算法的宽度和权重优于现有方法,以优化各种网络,数据集和目标的可靠网络。从数量上讲,对于MobileNetV2,可以在拖台和内存足迹上分别获得ImabiLenetV2的TOP-1准确性的改善。我们的结果突出了与可靠网络的权重共同优化不同层的通道计数的潜力。代码可在https://github.com/cmu-enyac/joslim上找到。
Slimmable neural networks provide a flexible trade-off front between prediction error and computational requirement (such as the number of floating-point operations or FLOPs) with the same storage requirement as a single model. They are useful for reducing maintenance overhead for deploying models to devices with different memory constraints and are useful for optimizing the efficiency of a system with many CNNs. However, existing slimmable network approaches either do not optimize layer-wise widths or optimize the shared-weights and layer-wise widths independently, thereby leaving significant room for improvement by joint width and weight optimization. In this work, we propose a general framework to enable joint optimization for both width configurations and weights of slimmable networks. Our framework subsumes conventional and NAS-based slimmable methods as special cases and provides flexibility to improve over existing methods. From a practical standpoint, we propose Joslim, an algorithm that jointly optimizes both the widths and weights for slimmable nets, which outperforms existing methods for optimizing slimmable networks across various networks, datasets, and objectives. Quantitatively, improvements up to 1.7% and 8% in top-1 accuracy on the ImageNet dataset can be attained for MobileNetV2 considering FLOPs and memory footprint, respectively. Our results highlight the potential of optimizing the channel counts for different layers jointly with the weights for slimmable networks. Code available at https://github.com/cmu-enyac/Joslim.