论文标题

MIPS:基于蒙特卡洛树搜索的流处理系统的实例放置

MIPS: Instance Placement for Stream Processing Systems based on Monte Carlo Tree Search

论文作者

Huang, Xi, Shao, Ziyu, Yang, Yang

论文摘要

流处理引擎使现代系统能够实时对无界数据流进行大规模分析。他们通常将应用程序视为直接无环图,其流在各种处理单元的管道实例中流动。出现的一个关键挑战是实例放置,即确定在服务器跨服务器和最大资源利用率的服务器中将实例放置的位置。挑战不仅源于其内在的复杂性,而且还源于连续的应用部署之间的影响。大多数更新的引擎(例如Apache Heron)利用了更模块化的调度程序设计,该设计将任务分解为两个阶段:一个确定实例对符号映射,而另一个则重点关注将其委派给独立资源管理器的容器到服务器映射。在两个阶段中的未对准目标和调度程序设计可能会导致响应时间长或使用率较低。但是,到目前为止,几乎没有工作似乎解决了这一挑战。受蒙特卡洛树搜索(MCT)方法在各个领域的应用的最新成功启发,我们开发了一个新型模型来表征此类系统,提出问题并将映射的每个阶段置于顺序的决策过程中。通过采用MCTS方法,我们提出了MIPS,这是一种基于MCTS的实例放置方案,以及时但有效的方式决定两期映射。此外,我们讨论了实际问题并完善MIP,以进一步提高其性能。大量模拟的结果表明,给定样品的温和价值,MIP的表现优于现有方案,其流量减少和利用率得到了显着改善。据我们所知,本文是第一个研究两期映射问题并将MCT应用于解决挑战的文章。

Stream processing engines enable modern systems to conduct large-scale analytics over unbounded data streams in real time. They often view an application as a direct acyclic graph with streams flowing through pipelined instances of various processing units. One key challenge that emerges is instance placement, i.e., to decide the placement of instances across servers with minimum traffic across servers and maximum resource utilization. The challenge roots in not only its intrinsic complexity but also the impact between successive application deployments. Most updated engines such as Apache Heron exploits a more modularized scheduler design that decomposes the task into two stages: One decides the instance-to-container mapping while the other focuses on the container-to-server mapping that is delegated to standalone resource managers. The unaligned objectives and scheduler designs in the two stages may lead to long response times or low utilization. However, so far little work has appeared to address the challenge. Inspired by the recent success of applications of Monte Carlo Tree Search (MCTS) methods in various fields, we develop a novel model to characterize such systems, formulate the problem, and cast each stage of mapping into a sequential decision process. By adopting MCTS methods, we propose MIPS, an MCTS-based Instance Placement Scheme to decide the two-staged mapping in a timely yet efficient manner. In addition, we discuss practical issues and refine MIPS to further improve its performance. Results from extensive simulations show, given mild-value of samples, MIPS outperforms existing schemes with a significant traffic reduction and utilization improvement. To our best knowledge, this paper is the first to study the two-staged mapping problem and to apply MCTS to solving the challenge.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源