论文标题

具有非负数据的非阴性最小二乘的快速量表不变算法

A Fast Scale-Invariant Algorithm for Non-negative Least Squares with Non-negative Data

论文作者

Diakonikolas, Jelena, Li, Chenghui, Padmanabhan, Swati, Song, Chaobing

论文摘要

非负(线性)最小平方问题是统计学习中经过充分研究的基本问题,并且在机器学习社区中使用的许多标准编程语言中已经实施了求解器。现有的现成的求解器将这些问题的非阴性限制视为障碍,并且与不受限制的最小二乘正方形相比,要付出额外的努力来解决它。但是,在许多典型应用程序中,数据本身也不负,我们表明在这种情况下的非负性使问题更容易。特别是,虽然不受约束的最小二乘问题的甲骨文复杂性必定与数据矩阵常数之一(通常是光谱规范)缩放,并且这些问题可以求解为添加误差,但我们表明,非负性最小二乘问题与非阴性数据可溶于多种误差,并且具有与任何矩阵常数相关的复杂性。我们引入的算法是加速的,并基于原始偶的观点。我们进一步展示了如何使用自适应重新启动与我们的方法相结合,并通过数值实验证明了其在大规模数据上的有效性。

Nonnegative (linear) least square problems are a fundamental class of problems that is well-studied in statistical learning and for which solvers have been implemented in many of the standard programming languages used within the machine learning community. The existing off-the-shelf solvers view the non-negativity constraint in these problems as an obstacle and, compared to unconstrained least squares, perform additional effort to address it. However, in many of the typical applications, the data itself is nonnegative as well, and we show that the nonnegativity in this case makes the problem easier. In particular, while the oracle complexity of unconstrained least squares problems necessarily scales with one of the data matrix constants (typically the spectral norm) and these problems are solved to additive error, we show that nonnegative least squares problems with nonnegative data are solvable to multiplicative error and with complexity that is independent of any matrix constants. The algorithm we introduce is accelerated and based on a primal-dual perspective. We further show how to provably obtain linear convergence using adaptive restart coupled with our method and demonstrate its effectiveness on large-scale data via numerical experiments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源