论文标题
部分可观测时空混沌系统的无模型预测
Adaptive Self-supervision Algorithms for Physics-informed Neural Networks
论文作者
论文摘要
物理知识的神经网络(PINN)将问题领域的物理知识作为对损失函数的软限制,但最近的工作表明,这可能导致优化困难。在这里,我们研究了搭配点的位置对这些模型训练性的影响。我们发现,随着训练的进行,可以通过适应搭配点的位置来显着提高香草Pinn的性能。具体而言,我们提出了一种新型的自适应搭配方案,该方案逐渐将更多的搭配点(没有增加数量)分配给模型造成更高误差的区域(基于域中损耗函数的梯度)。加上在任何优化失速过程中对训练的明智重新启动(通过简单地重新采样搭配点以调整损失格局)会导致预测误差的更好估计。我们提出了几个问题的结果,包括具有不同强迫函数的2D泊松和扩散 - 辅助系统。我们发现,针对这些问题的训练香草PINN可能会导致解决方案中的预测误差高达70%,尤其是在低搭配点的状态下。相反,我们的自适应方案可以达到较小误差的顺序,其计算复杂性与基线相似。此外,我们发现自适应方法始终如一地执行PAR或比香草Pinn方法稍好一些,即使对于大型搭配点方案也是如此。所有实验的代码都是开源的。
Physics-informed neural networks (PINNs) incorporate physical knowledge from the problem domain as a soft constraint on the loss function, but recent work has shown that this can lead to optimization difficulties. Here, we study the impact of the location of the collocation points on the trainability of these models. We find that the vanilla PINN performance can be significantly boosted by adapting the location of the collocation points as training proceeds. Specifically, we propose a novel adaptive collocation scheme which progressively allocates more collocation points (without increasing their number) to areas where the model is making higher errors (based on the gradient of the loss function in the domain). This, coupled with a judicious restarting of the training during any optimization stalls (by simply resampling the collocation points in order to adjust the loss landscape) leads to better estimates for the prediction error. We present results for several problems, including a 2D Poisson and diffusion-advection system with different forcing functions. We find that training vanilla PINNs for these problems can result in up to 70% prediction error in the solution, especially in the regime of low collocation points. In contrast, our adaptive schemes can achieve up to an order of magnitude smaller error, with similar computational complexity as the baseline. Furthermore, we find that the adaptive methods consistently perform on-par or slightly better than vanilla PINN method, even for large collocation point regimes. The code for all the experiments has been open sourced.