TY - GEN
T1 - Composition of Algorithmic Building Blocks in Template Task Graphs
AU - Herault, Thomas
AU - Schuchart, Joseph
AU - Valeev, Edward F.
AU - Bosilca, George
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In this paper, we explore the composition capabilities of the Template Task Graph (TTG) programming model. We show how fine-grain composition of tasks is possible in TTG between DAGs belonging to different libraries, even in a distributed setup. We illustrate the benefits of this fine-grain composition on a linear algebra operation, the matrix inversion via the Cholesky method, which consists of three operations that need to be applied in sequence.Evaluation on a cluster of many core shows that the transparent fine-grain composition implements the complex operation without introducing unnecessary synchronizations, increasing the overlap of communication and computation, and thus improving significantly the performance of the entire composed operation.
AB - In this paper, we explore the composition capabilities of the Template Task Graph (TTG) programming model. We show how fine-grain composition of tasks is possible in TTG between DAGs belonging to different libraries, even in a distributed setup. We illustrate the benefits of this fine-grain composition on a linear algebra operation, the matrix inversion via the Cholesky method, which consists of three operations that need to be applied in sequence.Evaluation on a cluster of many core shows that the transparent fine-grain composition implements the complex operation without introducing unnecessary synchronizations, increasing the overlap of communication and computation, and thus improving significantly the performance of the entire composed operation.
KW - Layered software design
KW - Library composition
KW - Task-Based Runtime System
KW - Template Task Graph
UR - https://www.scopus.com/pages/publications/85148041049
U2 - 10.1109/PAW-ATM56565.2022.00008
DO - 10.1109/PAW-ATM56565.2022.00008
M3 - Conference contribution
AN - SCOPUS:85148041049
T3 - Proceedings of PAW-ATM 2022: Parallel Applications Workshop, Alternatives to MPI+X, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis
SP - 26
EP - 38
BT - Proceedings of PAW-ATM 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th Annual IEEE/ACM Parallel Applications Workshop, Alternatives to MPI+X, PAW-ATM 2022
Y2 - 13 November 2022 through 18 November 2022
ER -