Skip to main navigation Skip to search Skip to main content

Parallel sparse FFT

  • Cheng Wang
  • , Mauricio Araya-Polo
  • , Sunita Chandrasekaran
  • , Amik St-Cyr
  • , Barbara Chapman
  • , Detlef Hohl
  • University of Houston
  • Royal Dutch Shell PLC

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

The Fast Fourier Transform (FFT) is a widely used numerical algorithm. When N input data points lead to only k ≪ N non-zero coefficients in the transformed domain, the algorithm is clearly inefficient: the FFT performs O(NlogN) operations on N input data points in order to calculate only k non-zero or large coefficients, and N - k zero or negligibly small ones. The recently developed sparse FFT (sFFT) algorithm provides a solution to this problem. As are those for the FFT, sFFT algorithms are complex and still computationally challenging. The computational difficulties are mainly due to memory access patterns that are irregular and dynamically changing. Modern compute platforms are exclusively based on multi-core processors, therefore a natural path to enhance the sFFT's performance is to exploit parallelism. This is the approach chosen in this work. We have analyzed in detail and parallelized the most time consuming segments of the algorithm. Our parallel sFFT (PsFFT) implementation achieves approximately 60% parallel efficiency on a single 8-core Intel Sandy Bridge socket for relevant test cases. In addition, we apply several techniques such as index coalescing, data affiliated loops and multi-level blocking techniques to alleviate memory access congestion and increase performance.

Original languageEnglish
Title of host publicationProc. of IA3 2013 - 3rd Workshop on Irregular Appl.
Subtitle of host publicationArchitectures and Algorithms, Held in Conjunction with SC 2013: The Int. Conf. for High Performance Computing, Networking, Storage and Analysis
DOIs
StatePublished - 2013
Event3rd Workshop on Irregular Applications: Architectures and Algorithms, IA3 2013 - Held in Conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 - Denver, CO, United States
Duration: Nov 17 2013Nov 22 2013

Publication series

NameProc. of IA3 2013 - 3rd Workshop on Irregular Appl.: Architectures and Algorithms, Held in Conjunction with SC 2013: The Int. Conf. for High Performance Computing, Networking, Storage and Analysis

Conference

Conference3rd Workshop on Irregular Applications: Architectures and Algorithms, IA3 2013 - Held in Conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013
Country/TerritoryUnited States
CityDenver, CO
Period11/17/1311/22/13

Keywords

  • irregular algorithms
  • OpenMP
  • sparse FFT

Fingerprint

Dive into the research topics of 'Parallel sparse FFT'. Together they form a unique fingerprint.

Cite this