Back to overview

SiL: An Approach for Adjusting Applications to Heterogeneous Systems Under Perturbations

Type of publication Peer-reviewed
Publikationsform Proceedings (peer-reviewed)
Author Mohammed Ali, Ciorba Florina M.,
Project Multilevel Scheduling in Large Scale High Performance Computers
Show all

Proceedings (peer-reviewed)

ISBN 978-3-030-10548-8
Title of proceedings Euro-Par 2018: Parallel Processing WorkshopsEuro-Par 2018 International Workshops
DOI 10.1007/978-3-030-10549-5

Open Access

Type of Open Access Website


Scientific applications consist of large and computationally-intensive loops. Dynamic loop scheduling (DLS) techniques are used to load balance the execution of such applications. Load imbalance can be caused by variations in loop iteration execution times due to problem, algorithmic, or systemic characteristics (also perturbations). The following question motivates this work: “Given an application, a high-performance computing (HPC) system, and their characteristics and interplay, which DLS technique will achieve improved performance under unpredictable perturbations?” Existing work only considers perturbations caused by variations in the HPC system delivered computational speeds. However, perturbations in available network bandwidth or latency are inevitable on production HPC systems. Simulator in the loop (SiL) is introduced, herein, as a new control-theoretic inspired approach to dynamically select DLS techniques that improve the performance of applications on heterogeneous HPC systems under perturbations. The present work examines the performance of six applications on a heterogeneous system under all above system perturbations. The SiL proof of concept is evaluated using simulation. The performance results confirm the initial hypothesis that no single DLS technique can deliver best performance in all scenarios, whereas the SiL-based DLS selection achieved improved application performance in most experiments.