Skip to main navigation Skip to search Skip to main content

Performance Study on CPU-based Machine Learning with PyTorch

  • Stony Brook University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Over the past decade we have seen a surge in research in Machine Learning. Deep neural networks represent a subclass of machine learning and are computationally intensive. Traditionally, GPUs have been leveraged to accelerate the training of such deep networks by taking advantage of parallelization and the many core architecture. As the datasets and models grow larger, scaling the training or inference task can help reduce the time to solution for research or production purposes. The Supercomputer Fugaku established state of the art results in multiple benchmarks in machine learning by scaling ARM based CPU technology. To that end, we study and present the performance of machine learning training and inference tasks on 64-bit ARM CPU architecture by exploiting its features namely the Scalable Vector Extensions (SVE) in the ARMv8-A.

Original languageEnglish
Title of host publicationProceedings of International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2023
PublisherAssociation for Computing Machinery
Pages24-34
Number of pages11
ISBN (Electronic)9781450399890
DOIs
StatePublished - Feb 27 2023
Event2023 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2023 - Singapore, Singapore
Duration: Feb 27 2023Mar 2 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2023 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2023
Country/TerritorySingapore
CitySingapore
Period02/27/2303/2/23

Keywords

  • Distributed Learning
  • High Performance Computing
  • Machine Learning
  • Scalability

Fingerprint

Dive into the research topics of 'Performance Study on CPU-based Machine Learning with PyTorch'. Together they form a unique fingerprint.

Cite this