贝叶斯深度学习图

论文标题

贝叶斯深度学习图

Bayesian Deep Learning for Graphs

论文作者

Errica, Federico

论文摘要

结构化数据的自适应处理是机器学习中的一个长期研究主题，它研究了如何自动从结构化输入到各种性质的输出来自动学习映射。最近，对图的自适应处理引起了人们的兴趣，这导致了不同的基于神经网络的方法的发展。在本文中，我们采用了不同的路线，并为图形学习开发了贝叶斯深度学习框架。论文始于对本领域中大多数方法的原理的审查，然后是有关图形分类可重复性问题的研究。然后，我们通过以渐进的方式建立我们的深度体系结构来弥合图形深度学习的基本思想。该框架使我们能够考虑具有离散和连续边缘功能的图形，从而产生足够丰富的无监督的嵌入，以便在多个分类任务上达到最新技术。我们的方法也适合贝叶斯非参数扩展，该扩展几乎可以自动选择所有模型的超参数。两个现实世界的应用显示了深度学习对图的功效。首先涉及使用监督神经模型的分子模拟信息理论量的预测。之后，我们利用贝叶斯模型来解决一项恶意软件分类任务，同时又对术内代码混淆技术进行了强大的态度。我们结束论文，试图将神经和贝叶斯世界的最好的世界融合在一起。所得的混合模型能够预测以输入图为条件的多模式分布，因此具有比大多数作品更好地建模随机性和不确定性的能力。总体而言，我们旨在为图形深度学习的清晰研究领域提供贝叶斯的观点。

The adaptive processing of structured data is a long-standing research topic in machine learning that investigates how to automatically learn a mapping from a structured input to outputs of various nature. Recently, there has been an increasing interest in the adaptive processing of graphs, which led to the development of different neural network-based methodologies. In this thesis, we take a different route and develop a Bayesian Deep Learning framework for graph learning. The dissertation begins with a review of the principles over which most of the methods in the field are built, followed by a study on graph classification reproducibility issues. We then proceed to bridge the basic ideas of deep learning for graphs with the Bayesian world, by building our deep architectures in an incremental fashion. This framework allows us to consider graphs with discrete and continuous edge features, producing unsupervised embeddings rich enough to reach the state of the art on several classification tasks. Our approach is also amenable to a Bayesian nonparametric extension that automatizes the choice of almost all model's hyper-parameters. Two real-world applications demonstrate the efficacy of deep learning for graphs. The first concerns the prediction of information-theoretic quantities for molecular simulations with supervised neural models. After that, we exploit our Bayesian models to solve a malware-classification task while being robust to intra-procedural code obfuscation techniques. We conclude the dissertation with an attempt to blend the best of the neural and Bayesian worlds together. The resulting hybrid model is able to predict multimodal distributions conditioned on input graphs, with the consequent ability to model stochasticity and uncertainty better than most works. Overall, we aim to provide a Bayesian perspective into the articulated research field of deep learning for graphs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题