TY - GEN
T1 - SHCOLL - A standalone implementation of OpenSHMEM-style collectives API
AU - Milaković, Srđan
AU - Budimlić, Zoran
AU - Pritchard, Howard
AU - Curtis, Anthony
AU - Chapman, Barbara
AU - Sarkar, Vivek
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - The performance of collective operations has a large impact on overall performance in many HPC applications. Implementing multiple algorithms and selecting optimal one depending on message size and the number of processes involved in the operation is essential to achieve good performance. In this paper, we will present SHCOLL, a collective routines library that was developed on top of OpenSHMEM API point to point operations: puts, gets, atomic memory update, and memory synchronization routines. The library is designed to serve as a plug-in to OpenSHMEM implementations and will be used by the OSSS OpenSHMEM reference implementation to support OpenSHMEM collective operations. In this paper, we describe the algorithms that have been incorporated in the implementation of each OpenSHMEM API collective routine and evaluate them on a Cray XC30 system. For long messages, SHCOLL shows an improvement by up to a factor of 12 compared to the vendor’s implementation. We also discuss future development of the library, as well as how it will be incorporated into the OSSS OpenSHMEM reference implementation.
AB - The performance of collective operations has a large impact on overall performance in many HPC applications. Implementing multiple algorithms and selecting optimal one depending on message size and the number of processes involved in the operation is essential to achieve good performance. In this paper, we will present SHCOLL, a collective routines library that was developed on top of OpenSHMEM API point to point operations: puts, gets, atomic memory update, and memory synchronization routines. The library is designed to serve as a plug-in to OpenSHMEM implementations and will be used by the OSSS OpenSHMEM reference implementation to support OpenSHMEM collective operations. In this paper, we describe the algorithms that have been incorporated in the implementation of each OpenSHMEM API collective routine and evaluate them on a Cray XC30 system. For long messages, SHCOLL shows an improvement by up to a factor of 12 compared to the vendor’s implementation. We also discuss future development of the library, as well as how it will be incorporated into the OSSS OpenSHMEM reference implementation.
UR - https://www.scopus.com/pages/publications/85064315615
U2 - 10.1007/978-3-030-04918-8_6
DO - 10.1007/978-3-030-04918-8_6
M3 - Conference contribution
AN - SCOPUS:85064315615
SN - 9783030049171
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 90
EP - 106
BT - OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity - 5th Workshop, OpenSHMEM 2018, Revised Selected Papers
A2 - Pophale, Swaroop
A2 - Imam, Neena
A2 - Aderholdt, Ferrol
A2 - Gorentla Venkata, Manjunath
PB - Springer Verlag
T2 - 5th Workshop on OpenSHMEM and Related Technologies, 2018
Y2 - 21 August 2018 through 23 August 2018
ER -