TY - GEN
T1 - A machine learning framework to improve storage system performance
AU - Akgun, Ibrahim Umit
AU - Aydin, Ali Selman
AU - Shaikh, Aadil
AU - Velikov, Lukas
AU - Zadok, Erez
N1 - Publisher Copyright:
© 2021 Association for Computing Machinery.
PY - 2021/7/20
Y1 - 2021/7/20
N2 - Storage systems and their OS components are designed to accommodate a wide variety of applications and dynamic workloads. Storage components inside the OS contain various heuristic algorithms to provide high performance and adaptability for different workloads. These heuristics may be tunable via parameters, and some system calls allow users to optimize their system performance. These parameters are often predetermined based on experiments with limited applications and hardware. Thus, storage systems often run with these predetermined and possibly suboptimal values. Tuning these parameters manually is impractical: one needs an adaptive, intelligent system to handle dynamic and complex workloads. Machine learning (ML) techniques are capable of recognizing patterns, abstracting them, and making predictions on new data. ML can be a key component to optimize and adapt storage systems. In this position paper, we propose KML, an ML framework for storage systems. We implemented a prototype and demonstrated its capabilities on the well-known problem of tuning optimal readahead values. Our results show that KML has a small memory footprint, introduces negligible overhead, and yet enhances throughput by as much as 2.3x.
AB - Storage systems and their OS components are designed to accommodate a wide variety of applications and dynamic workloads. Storage components inside the OS contain various heuristic algorithms to provide high performance and adaptability for different workloads. These heuristics may be tunable via parameters, and some system calls allow users to optimize their system performance. These parameters are often predetermined based on experiments with limited applications and hardware. Thus, storage systems often run with these predetermined and possibly suboptimal values. Tuning these parameters manually is impractical: one needs an adaptive, intelligent system to handle dynamic and complex workloads. Machine learning (ML) techniques are capable of recognizing patterns, abstracting them, and making predictions on new data. ML can be a key component to optimize and adapt storage systems. In this position paper, we propose KML, an ML framework for storage systems. We implemented a prototype and demonstrated its capabilities on the well-known problem of tuning optimal readahead values. Our results show that KML has a small memory footprint, introduces negligible overhead, and yet enhances throughput by as much as 2.3x.
KW - Machine learning
KW - Operating systems
KW - Storage performance optimization
KW - Storage systems
UR - https://www.scopus.com/pages/publications/85112029989
U2 - 10.1145/3465332.3470875
DO - 10.1145/3465332.3470875
M3 - Conference contribution
AN - SCOPUS:85112029989
T3 - HotStorage 2021 - Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems
SP - 94
EP - 102
BT - HotStorage 2021 - Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems
PB - Association for Computing Machinery, Inc
T2 - 13th ACM Workshop on Hot Topics In Storage and File Systems, HotStorage 2021
Y2 - 27 July 2021 through 28 July 2021
ER -