TY - GEN
T1 - Task scheduling and file replication for data-intensive jobs with batch-shared I/O
AU - Khanna, Gaurav
AU - Vydyanathan, Nagavijayalakshmi
AU - Catalyurek, Umit
AU - Kurc, Tahsin
AU - Krishnamoorthyt, Sriram
AU - Sadayappan, P.
AU - Saltz, Joel
PY - 2006
Y1 - 2006
N2 - This paper addresses the problem of efficient execution of a batch of data-intensive tasks with batch-shared I/O behavior, on coupled storage and compute clusters. Two scheduling schemes are proposed: 1) a 0-1 Integer Programming (IP) based approach, which couples task scheduling and data replication, and 2) a bi-level hypergraph partitioning based heuristic approach (BiPartition), which decouples task scheduling and data replication. The experimental results show that: 1) the IP scheme achieves the best batch execution time, but has significant scheduling overhead, thereby restricting its application to small scale workloads, and 2) the BiPartition scheme is a better fit for larger workloads and systems - it has very low scheduling overhead and no more than 5-10% degradation in solution quality, when compared with the IP based approach.
AB - This paper addresses the problem of efficient execution of a batch of data-intensive tasks with batch-shared I/O behavior, on coupled storage and compute clusters. Two scheduling schemes are proposed: 1) a 0-1 Integer Programming (IP) based approach, which couples task scheduling and data replication, and 2) a bi-level hypergraph partitioning based heuristic approach (BiPartition), which decouples task scheduling and data replication. The experimental results show that: 1) the IP scheme achieves the best batch execution time, but has significant scheduling overhead, thereby restricting its application to small scale workloads, and 2) the BiPartition scheme is a better fit for larger workloads and systems - it has very low scheduling overhead and no more than 5-10% degradation in solution quality, when compared with the IP based approach.
UR - https://www.scopus.com/pages/publications/33845869918
M3 - Conference contribution
AN - SCOPUS:33845869918
SN - 1424403073
SN - 9781424403073
T3 - Proceedings of the IEEE International Symposium on High Performance Distributed Computing
SP - 241
EP - 252
BT - Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, HPDC-15
T2 - 15th IEEE International Symposium on High Performance Distributed Computing, HPDC-15
Y2 - 19 June 2006 through 23 June 2006
ER -