Abstract
Workflow is an important model for big data processing and resource provisioning is crucial to the performance of workflows. Recently, system variations in the cloud and large-scale clusters, such as those in I/O and network performances, have been observed to greatly affect the performance of workflows. Traditional resource provisioning methods, which overlook these variations, can lead to suboptimal resource provisioning results. In this paper, we provide a general solution for workflow performance optimizations considering system variations. Specifically, we model system variations as time-dependent random variables and take their probability distributions as optimization input. Despite its effectiveness, this solution involves heavy computation overhead. Thus, we propose three pruning techniques to simplify workflow structure and reduce the probability evaluation overhead. We implement our techniques in a runtime library, which allows users to incorporate efficient probabilistic optimization into existing resource provisioning methods. Experiments show that probabilistic solutions can improve the performance by 51% compared to state-of-the-art static solutions while guaranteeing budget constraint, and our pruning techniques can greatly reduce the overhead of probabilistic optimization.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 48th International Conference on Parallel Processing, ICPP 2019 |
| Place of Publication | New York |
| Publisher | Association for Computing Machinery (ACM) |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450362955 |
| DOIs | |
| Publication status | Published - 5 Aug 2019 |
| Event | 48th International Conference on Parallel Processing, ICPP 2019 - Kyoto, Japan Duration: 5 Aug 2019 → 8 Aug 2019 https://dl.acm.org/doi/proceedings/10.1145/3337821 (Conference Proceedings) |
Publication series
| Name | ACM International Conference Proceeding Series |
|---|
Conference
| Conference | 48th International Conference on Parallel Processing, ICPP 2019 |
|---|---|
| Country/Territory | Japan |
| City | Kyoto |
| Period | 5/08/19 → 8/08/19 |
| Internet address |
|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
User-Defined Keywords
- Resource provisioning
- Cloud dynamics
- Workflows
Fingerprint
Dive into the research topics of 'Incorporating probabilistic optimizations for resource provisioning of data processing workflows'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver