Skip to main navigation Skip to search Skip to main content

Adapting irregular computations to large CPU-GPU clusters in the MADNESS framework

  • Northeastern University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Graphics Processing Units (GPUs) are becoming the workhorse of scalable computations. MADNESS is a scientific framework used especially for computational chemistry. Most MADNESS applications use operators that involve many small tensor computations, resulting in a less regular organization of computations on GPUs. A single GPU kernel may have to multiply by hundreds of small square matrices (with fixed dimension ranging from 10 to 28). We demonstrate a scalable CPU-GPU implementation of the MADNESS framework over a 500-node partition on the Titan supercomputer. For this hybrid CPU-GPU implementation, we observe up to a 2.3-times speedup compared to an equivalent CPU-only implementation with 16 cores per node. For smaller matrices, we demonstrate a speedup of 2.2-times by using a custom CUDA kernel rather than a cuBLAS-based kernel.

Original languageEnglish
Title of host publicationProceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012
PublisherIEEE Computer Society
Pages1-9
Number of pages9
ISBN (Print)9780768548074
DOIs
StatePublished - 2012
Event2012 IEEE International Conference on Cluster Computing, CLUSTER 2012 - Beijing, China
Duration: Sep 24 2012Sep 28 2012

Publication series

NameProceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012

Conference

Conference2012 IEEE International Conference on Cluster Computing, CLUSTER 2012
Country/TerritoryChina
CityBeijing
Period09/24/1209/28/12

Keywords

  • GPU
  • heterogeneous CPU-GPU
  • hybrid CPU-GPU
  • irregular computation
  • supercomputing

Fingerprint

Dive into the research topics of 'Adapting irregular computations to large CPU-GPU clusters in the MADNESS framework'. Together they form a unique fingerprint.

Cite this