Skip to main navigation Skip to search Skip to main content

Optimizing latency and throughput of application workflows on clusters

  • Ohio State University

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Scheduling, in many application domains, involves optimization of multiple performance metrics. For example, application workflows with real-time constraints have strict throughput requirements and also desire a low latency or response time. In this paper, we present a novel algorithm for the scheduling of workflows that act on a stream of input data. Our algorithm focuses on the two performance metrics, latency and throughput, and minimizes the latency of workflows while satisfying strict throughput requirements. We also describe steps to use the above approach to solve the problem of meeting latency requirements while maximizing throughput. We leverage pipelined, task and data parallelism in a coordinated manner to meet these objectives and investigate the benefit of task duplication in alleviating communication overheads in the pipelined schedule for different workflow characteristics. The proposed algorithm is designed for a realistic bounded multi-port communication model, where each processor can simultaneously communicate with at most k distinct processors. Experimental evaluation using synthetic benchmarks as well as those derived from real applications shows that our algorithm consistently produces lower latency schedules that meet throughput requirements, even when previously proposed schemes fail.

Original languageEnglish
Pages (from-to)694-712
Number of pages19
JournalParallel Computing
Volume37
Issue number10-11
DOIs
StatePublished - Oct 2011

Keywords

  • Data-parallelism
  • Latency
  • Pipelined-parallelism
  • Streaming applications
  • Task-parallelism
  • Throughput

Fingerprint

Dive into the research topics of 'Optimizing latency and throughput of application workflows on clusters'. Together they form a unique fingerprint.

Cite this