Algorithms & Analysis / Storage & Data (13) Scalable I/O for Extreme Performance

Name:Algorithms & Analysis / Storage & Data (13) Scalable I/O for Extreme Performance
Time:Monday, June 18, 2012
03:00 PM - 08:00 PM
Room:Hall H, #911
Speakers:Alvaro Aguilera, Technische Universität Dresden
Description: In order for the I/O operations not to become the scalability bottleneck of HPC applications, the file system and I/O infrastructure must keep pace with the increasing performance and number of computing cores present on HPC systems. In this context, a global optimization of the file system turns out to be next to impossible. In part due to the disparate nature of the requirements and expectations of different user groups, and in part because currently there is no way to identify abnormal I/O behavior and trace it back to its source. SIOX main goal is to gain an overview of all the I/O activity taking place on a HPC system, and to use this information to optimize it. The project`s scope spans the development of standardized interfaces to collect, reduce, and store performance data from all relevant layers. This information will then be analyzed and correlated with previously observed access patterns in order to gain an understanding of the characteristics of the system. This knowledge will be the starting point for subsequent performance optimizations aimed at specific users and applications, carried out through e.g. the automatic tuning of Open MPI or file system parameters. Such use-profiles are going to be continuously created and not only helpful for optimization, but also when diagnosing acute performance problems, or when planning new acquisitions. In the course of the project, an holistic approach for I/O analysis should be conceived, implemented and applied.