Skip to main navigation Skip to search Skip to main content

An efficient, nonintrusive, Log-Based I/O mechanism for scientific simulations on clusters

  • Soumyadeb Mitra
  • , Rishi Rakesh Sinha
  • , Marianne Winslett
  • , Xiangmin Jiao
  • UIUC

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Scientific simulations are often very I/O intensive, requiring high HO bandwidth to store the data generated by the simulation. Traditional supercomputers have specialized HO systems with multiple I/O nodes and specialized interconnects to handle such high I/O loads. However, with the increased availability of inexpensive clusters of work-stations, more and more simulations are now run on clusters. Unfortunately, cluster supercomputers are usually not very well equipped for HO, making I/O a serious bottleneck for such applications. To address this problem, we propose Log-Based I/O (LBIO), an approach that can substantially increase the I/O performance of simulations on clusters by utilizing free space on the cluster's local disks to stage data on its way to remote storage. LBIO uses local disks to create a log of all I/O calls, and uses a background thread to replay the log at the rate that best utilizes the server and network resources. LBIO is implemented as an easy-to-use, non-intrusive library - a user can turn on LBIO by adding a single initialization call to the simulation code. LBIO also works with existing scientific I/O libraries like HDF, as well as collective libraries like ROMIO. Our performance studies on microbenchmarks and a real-world scientific simulation code show that LBIO can provide upto 35% improvement in I/O performance for raw I/O and over 50% for I/O through libraries like ROMIO or HDF.

Original languageEnglish
Title of host publication2005 IEEE International Conference on Cluster Computing, CLUSTER
DOIs
StatePublished - 2005
Event2005 IEEE International Conference on Cluster Computing, CLUSTER - Burlington, MA, United States
Duration: Sep 27 2005Sep 30 2005

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (Print)1552-5244

Conference

Conference2005 IEEE International Conference on Cluster Computing, CLUSTER
Country/TerritoryUnited States
CityBurlington, MA
Period09/27/0509/30/05

Fingerprint

Dive into the research topics of 'An efficient, nonintrusive, Log-Based I/O mechanism for scientific simulations on clusters'. Together they form a unique fingerprint.

Cite this