有效的非参数优化器搜索各种任务

论文标题

有效的非参数优化器搜索各种任务

Efficient Non-Parametric Optimizer Search for Diverse Tasks

论文作者

Wang, Ruochen, Xiong, Yuanhao, Cheng, Minhao, Hsieh, Cho-Jui

论文摘要

优化器的高效和自动化设计在全栈自动系统中起着至关重要的作用。但是，优化器搜索中的先前方法通常受其可扩展性，生成性或样本效率的限制。为了将优化器搜索的研究和应用民主化，我们提出了可以直接搜索感兴趣的任务的第一个高效，可扩展和可推广的框架。我们首先观察到优化器更新从根本上是数学表达式应用于梯度。受到基础数学表达式的先天树结构的启发，我们将优化器的空间重新安排到一个超树中，每个路径都编码优化器。这样，优化器搜索可以自然地作为路径找到问题，从而可以将各种良好的树遍历方法用作搜索算法。我们采用蒙特卡洛方法的改编来进行树木搜索，配备了拒绝采样和等效形式检测，从而利用优化器更新规则的特征来进一步提高样本效率。我们提供了一套多种任务，以基准我们的算法，并证明只有128个评估，提出的框架可以发现超过人类设计的对应方和先前的优化器搜索方法的优化器。

Efficient and automated design of optimizers plays a crucial role in full-stack AutoML systems. However, prior methods in optimizer search are often limited by their scalability, generability, or sample efficiency. With the goal of democratizing research and application of optimizer search, we present the first efficient, scalable and generalizable framework that can directly search on the tasks of interest. We first observe that optimizer updates are fundamentally mathematical expressions applied to the gradient. Inspired by the innate tree structure of the underlying math expressions, we re-arrange the space of optimizers into a super-tree, where each path encodes an optimizer. This way, optimizer search can be naturally formulated as a path-finding problem, allowing a variety of well-established tree traversal methods to be used as the search algorithm. We adopt an adaptation of the Monte Carlo method to tree search, equipped with rejection sampling and equivalent-form detection that leverage the characteristics of optimizer update rules to further boost the sample efficiency. We provide a diverse set of tasks to benchmark our algorithm and demonstrate that, with only 128 evaluations, the proposed framework can discover optimizers that surpass both human-designed counterparts and prior optimizer search methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题