Back to overview

Dynamic Loop Scheduling Using MPI Passive-Target Remote Memory Access

Type of publication Peer-reviewed
Publikationsform Proceedings (peer-reviewed)
Author Eleliemy Ahmed, Ciorba Florina M.,
Project Multilevel Scheduling in Large Scale High Performance Computers
Show all

Proceedings (peer-reviewed)

Title of proceedings The Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)
DOI 10.1109/pdp.2019.00019

Open Access

Type of Open Access Repository (Green Open Access)


Scientific applications often contain large computationally-intensive parallel loops. Loop scheduling techniques aim to achieve load balanced executions of such applications. For distributed-memory systems, existing dynamic loop scheduling (DLS) libraries are typically MPI-based, and employ a master-worker execution model to assign variably-sized chunks of loop iterations. The master-worker execution model may adversely impact performance due to the master-level contention. This work proposes a distributed chunk-calculation approach that does not require the master-worker execution scheme. Moreover, it considers the novel features in the latest MPI standards, such as passive-target remote memory access, shared-memory window creation, and atomic read-modify-write operations. To evaluate the proposed approach, five well-known DLS techniques, two applications, and two heterogeneous hardware setups have been considered. The DLS techniques implemented using the proposed approach outperformed their counterparts implemented using the traditional master-worker execution model.