论文标题
现金:公共云平台的信用意识安排
CASH: A Credit Aware Scheduling for Public Cloud Platforms
论文作者
论文摘要
公共云提供了无数的服务,使其租户可以灵活,简单且具有成本效益的方式处理大型大数据。租户通常使用大型数据处理框架,例如MapReduce,Tez,Spark等来处理其数据。租户可以将其框架配置为通过框架本身运行单个任务,或者拥有像纱线或Mesos这样的中间件群集管理器以在其公共云集群中仲裁资源调度。集群经理需要认识到工作负载要求以及群集中的CPU和磁盘等各个资源的状态。云提供商为其单个硬件资源使用代币的存储桶机制,作为单个硬件资源可以提供的服务质量的指标。在本文中,通过我们对纱线,Hadoop和Tez的更改,我们展示了如何使中间件集群管理人员了解集群中各个硬件资源的预期服务质量。我们优化的集群管理器具有对任务需求的粗糙知识知识,以及群集中硬件资源服务质量的细粒度知识可执行高度最佳的任务位置。我们的优化实验表明,基于CPU信用的实例(例如Amazon T3实例)是运行BigData工作负载的可行成本效益选项。我们还表明,在蜂巢仓库上流式SQL查询可以加速31%,从而使公共云成本节省高达22%。
The public cloud offers a myriad of services which allows its tenants to process large scale big data in a flexible, easy and cost effective manner. Tenants generally use large scale data processing frameworks such as MapReduce, Tez, Spark etc. to process their data. Tenants can configure their frameworks to run individual tasks by the framework itself or have a middleware cluster manager like YARN or Mesos to arbitrate resource scheduling in their public-cloud cluster. Cluster managers need to be cognizant about the workload requirement along with the state of the individual resource such as CPU and disk in the cluster. Cloud providers use a token bucket mechanism for their individual hardware resources as an indicator of the quality-of-service that individual hardware resource can provide. In this paper, through our changes in YARN, Hadoop and Tez, we show how middleware cluster managers can be made cognizant about the expected quality-of-service of individual hardware resources in the cluster. Our optimized cluster manager with a coarse grained knowledge of task requirement and fine grained knowledge of expected quality-of-service of hardware resources in the cluster performs highly optimal task placements. Our experiments with our optimizations show CPU credit based instances like the Amazon T3 instances as a viable cost effective option for running bigdata workloads. We also show that streaming SQL queries on a Hive warehouse can be accelerated by up to 31% leading to public cloud cost savings of up to 22%.