TY - GEN
T1 - An efficient, nonintrusive, Log-Based I/O mechanism for scientific simulations on clusters
AU - Mitra, Soumyadeb
AU - Sinha, Rishi Rakesh
AU - Winslett, Marianne
AU - Jiao, Xiangmin
PY - 2005
Y1 - 2005
N2 - Scientific simulations are often very I/O intensive, requiring high HO bandwidth to store the data generated by the simulation. Traditional supercomputers have specialized HO systems with multiple I/O nodes and specialized interconnects to handle such high I/O loads. However, with the increased availability of inexpensive clusters of work-stations, more and more simulations are now run on clusters. Unfortunately, cluster supercomputers are usually not very well equipped for HO, making I/O a serious bottleneck for such applications. To address this problem, we propose Log-Based I/O (LBIO), an approach that can substantially increase the I/O performance of simulations on clusters by utilizing free space on the cluster's local disks to stage data on its way to remote storage. LBIO uses local disks to create a log of all I/O calls, and uses a background thread to replay the log at the rate that best utilizes the server and network resources. LBIO is implemented as an easy-to-use, non-intrusive library - a user can turn on LBIO by adding a single initialization call to the simulation code. LBIO also works with existing scientific I/O libraries like HDF, as well as collective libraries like ROMIO. Our performance studies on microbenchmarks and a real-world scientific simulation code show that LBIO can provide upto 35% improvement in I/O performance for raw I/O and over 50% for I/O through libraries like ROMIO or HDF.
AB - Scientific simulations are often very I/O intensive, requiring high HO bandwidth to store the data generated by the simulation. Traditional supercomputers have specialized HO systems with multiple I/O nodes and specialized interconnects to handle such high I/O loads. However, with the increased availability of inexpensive clusters of work-stations, more and more simulations are now run on clusters. Unfortunately, cluster supercomputers are usually not very well equipped for HO, making I/O a serious bottleneck for such applications. To address this problem, we propose Log-Based I/O (LBIO), an approach that can substantially increase the I/O performance of simulations on clusters by utilizing free space on the cluster's local disks to stage data on its way to remote storage. LBIO uses local disks to create a log of all I/O calls, and uses a background thread to replay the log at the rate that best utilizes the server and network resources. LBIO is implemented as an easy-to-use, non-intrusive library - a user can turn on LBIO by adding a single initialization call to the simulation code. LBIO also works with existing scientific I/O libraries like HDF, as well as collective libraries like ROMIO. Our performance studies on microbenchmarks and a real-world scientific simulation code show that LBIO can provide upto 35% improvement in I/O performance for raw I/O and over 50% for I/O through libraries like ROMIO or HDF.
UR - https://www.scopus.com/pages/publications/50149108019
U2 - 10.1109/CLUSTR.2005.347041
DO - 10.1109/CLUSTR.2005.347041
M3 - Conference contribution
AN - SCOPUS:50149108019
SN - 0780394852
SN - 9780780394858
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
BT - 2005 IEEE International Conference on Cluster Computing, CLUSTER
T2 - 2005 IEEE International Conference on Cluster Computing, CLUSTER
Y2 - 27 September 2005 through 30 September 2005
ER -