Abstract
In this paper we present an efficient dense matrix multiplication algorithm for distributed memory computers with a hypercube topology. The proposed algorithm performs better than all previously proposed algorithms for a wide range of matrix sizes and number of processors, especially for large matrices. We analyze the performance of the algorithms for two types of hypercube architectures, one in which each node can use (to send and receive) at most one communication link at a time and the other in which each node can use all communication links simultaneously.
| Original language | English |
|---|---|
| Pages (from-to) | 75-99 |
| Number of pages | 25 |
| Journal | Parallel Computing |
| Volume | 22 |
| Issue number | 1 |
| DOIs | |
| State | Published - Jan 1996 |
Keywords
- 3-D grids
- Distributed algorithms
- Hypercubes
- Interprocessor communication
- Matrix multiplication
Fingerprint
Dive into the research topics of 'Communication-efficient matrix multiplication on hypercubes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver