用于在瞬态服务器上运行交互式应用程序的云规模VM通缩

论文标题

用于在瞬态服务器上运行交互式应用程序的云规模VM通缩

Cloud-scale VM Deflation for Running Interactive Applications On Transient Servers

论文作者

Fuerst, Alexander, Ali-Eldin, Ahmed, Shenoy, Prashant, Sharma, Prateek

论文摘要

瞬态计算已在公共云环境中流行，以低成本运行延迟不敏感的批处理和数据处理应用程序。由于云提供商可以随时撤销瞬态云服务器，因此它们被认为不适合运行交互式应用程序（例如Web服务）。在本文中，我们将VM放气作为服务器抢占的替代机制，用于在资源压力下从瞬态云服务器中收回资源。使用来自顶级云提供商的真实痕迹，我们展示了将VM通气用作公共云中交互式应用的资源回收机制的可行性。我们展示了当前的管理程序机制如何用于实现VM通气，并呈现群集通缩策略，用于瞬态和点播云VM的资源管理。在Linux群集上对我们的通气系统的实验评估表明，基于微服务的应用程序可以通过高达50 \％的速度，而效果可忽略不计。我们的集群级通态政策允许高达50 \％的过度承诺水平，而应用程序吞吐量降低了1 \％，并且可以使云平台能够将收入提高30 \％。

Transient computing has become popular in public cloud environments for running delay-insensitive batch and data processing applications at low cost. Since transient cloud servers can be revoked at any time by the cloud provider, they are considered unsuitable for running interactive application such as web services. In this paper, we present VM deflation as an alternative mechanism to server preemption for reclaiming resources from transient cloud servers under resource pressure. Using real traces from top-tier cloud providers, we show the feasibility of using VM deflation as a resource reclamation mechanism for interactive applications in public clouds. We show how current hypervisor mechanisms can be used to implement VM deflation and present cluster deflation policies for resource management of transient and on-demand cloud VMs. Experimental evaluation of our deflation system on a Linux cluster shows that microservice-based applications can be deflated by up to 50\% with negligible performance overhead. Our cluster-level deflation policies allow overcommitment levels as high as 50\%, with less than a 1\% decrease in application throughput, and can enable cloud platforms to increase revenue by 30\%.

下载PDF全文

下载文献需遵守相关版权规定

论文标题