Skip to main navigation Skip to search Skip to main content

Evolving toward the perfect schedule: Co-scheduling job assignments and data replication in wide-area systems using a genetic algorithm

  • IBM

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

21 Scopus citations

Abstract

Traditional job schedulers for grid or cluster systems are responsible for assigning incoming jobs to compute nodes in such a way that some evaluative condition is met. Such systems generally take into consideration the availability of compute cycles, queue lengths, and expected job execution times, but they typically do not account directly for data staging and thus miss significant associated opportunities for optimisation. Intuitively, a tighter integration of job scheduling and automated data replication can yield significant advantages due to the potential for optimised, faster access to data and decreased overall execution time. In this paper we consider data placement as a first-class citizen in scheduling and use an optimisation heuristic for generating schedules. We make the following two contributions. First, we identify the necessity for co-scheduling job dispatching and data replication assignments and posit that simultaneously scheduling both is critical for achieving good makespans. Second, we show that deploying a genetic search algorithm to solve the optimal allocation problem has the potential to achieve significant speed-up results versus traditional allocation mechanisms. Through simulation, we show that our algorithm provides on average an approximately 20-45% faster makespan than greedy schedulers.

Original languageEnglish
Title of host publicationJob Scheduling Strategies for Parallel Processing - 11th International Workshop, JSSPP 2005, Revised Selected Papers
Pages173-193
Number of pages21
StatePublished - 2006
Event11th International Workshop on Job Scheduling Strategies for Parallel Processing, JSSPP 2005 - Cambridge, MA, United States
Duration: Jun 19 2005Jun 19 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3834 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Workshop on Job Scheduling Strategies for Parallel Processing, JSSPP 2005
Country/TerritoryUnited States
CityCambridge, MA
Period06/19/0506/19/05

Fingerprint

Dive into the research topics of 'Evolving toward the perfect schedule: Co-scheduling job assignments and data replication in wide-area systems using a genetic algorithm'. Together they form a unique fingerprint.

Cite this