论文标题
通过整体平均学习对未知动态系统的强大建模
Robust Modeling of Unknown Dynamical Systems via Ensemble Averaged Learning
论文作者
论文摘要
最近的工作集中在通过深度神经网络(DNN)的数据驱动的学习过程中,其目的是对未知系统的演变进行长时间的预测。在这种情况下,训练一个具有低概括错误的DNN是一项特别重要的任务,因为随着时间的推移会累积错误。由于DNN训练中的固有随机性,主要是在随机优化中,因此产生的预测存在不确定性,因此在概括误差中存在不确定性。因此,可以将概括误差视为具有一定概率分布的随机变量。训练有素的DNN,特别是具有许多超参数的DNN,通常会导致概率分布,其概率误差较低,但差异很高。高方差会导致变异性,并且在训练有素的DNN的结果中不可预测。本文提出了一种计算技术,该技术降低了概括误差的方差,从而提高了DNN模型的可靠性以始终如一地概括。在提出的整体平均方法中,多个模型是独立训练的,并且在每个时间步骤中进行平均模型预测。提出了该方法的数学基础,包括有关局部截断误差的分布的结果。此外,三个时间依赖性的微分方程问题被认为是数值示例,这证明了该方法通常降低DNN预测方差的有效性。
Recent work has focused on data-driven learning of the evolution of unknown systems via deep neural networks (DNNs), with the goal of conducting long time prediction of the evolution of the unknown system. Training a DNN with low generalization error is a particularly important task in this case as error is accumulated over time. Because of the inherent randomness in DNN training, chiefly in stochastic optimization, there is uncertainty in the resulting prediction, and therefore in the generalization error. Hence, the generalization error can be viewed as a random variable with some probability distribution. Well-trained DNNs, particularly those with many hyperparameters, typically result in probability distributions for generalization error with low bias but high variance. High variance causes variability and unpredictably in the results of a trained DNN. This paper presents a computational technique which decreases the variance of the generalization error, thereby improving the reliability of the DNN model to generalize consistently. In the proposed ensemble averaging method, multiple models are independently trained and model predictions are averaged at each time step. A mathematical foundation for the method is presented, including results regarding the distribution of the local truncation error. In addition, three time-dependent differential equation problems are considered as numerical examples, demonstrating the effectiveness of the method to decrease variance of DNN predictions generally.