论文标题
DSA:通过可区分的稀疏分配更有效的预算修剪
DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation
论文作者
论文摘要
预算修剪是在资源限制下修剪的问题。在预算的修剪中,如何跨层(即稀疏分配)分配资源是关键问题。传统方法通过离散地搜索缺乏效率的层修剪比来解决它。在本文中,我们提出了可区分的稀疏分配(DSA),这是一种有效的端到端预算修剪流。 DSA利用新型的可区分修剪过程,通过基于梯度的优化找到了层的修剪比。它可以在连续空间中分配稀疏性,这比基于离散评估和搜索的方法更有效。此外,DSA可以从划伤的方式上进行修剪,而传统的预算修剪方法则应用于预训练的模型。 CIFAR-10和ImageNet的实验结果表明,DSA比当前的迭代预算修剪方法可以取得更高的性能,并且在此期间,整个修剪过程的时间成本缩短了至少1.5倍。
Budgeted pruning is the problem of pruning under resource constraints. In budgeted pruning, how to distribute the resources across layers (i.e., sparsity allocation) is the key problem. Traditional methods solve it by discretely searching for the layer-wise pruning ratios, which lacks efficiency. In this paper, we propose Differentiable Sparsity Allocation (DSA), an efficient end-to-end budgeted pruning flow. Utilizing a novel differentiable pruning process, DSA finds the layer-wise pruning ratios with gradient-based optimization. It allocates sparsity in continuous space, which is more efficient than methods based on discrete evaluation and search. Furthermore, DSA could work in a pruning-from-scratch manner, whereas traditional budgeted pruning methods are applied to pre-trained models. Experimental results on CIFAR-10 and ImageNet show that DSA could achieve superior performance than current iterative budgeted pruning methods, and shorten the time cost of the overall pruning process by at least 1.5x in the meantime.