1-4 December 2024
Boardwalk Convention Centre
Africa/Johannesburg timezone
Keynote starting now at 19:00.

® XIO: Towards eXplainable I/O using multi-layer monitoring solution for HPC Systems.

3 Dec 2024, 16:30
20m
BICC.G-D1 - D1 Tsitsikamma (Boardwalk Convention Centre)

BICC.G-D1 - D1 Tsitsikamma

Boardwalk Convention Centre

120
Talk Storage and IO HPC Technology

Speaker

Hari Hariharan Devarajan (LLNL)

Description

Modern HPC workloads exchange vast amounts of data to drive scientific discoveries While HPC systems employ diverse storage devices and tiers to support efficient data access, current monitoring infrastructures, such as Darshan and Score-P, only provide enough information to show what they see, but lack the visibility needed to fully explain observed I/O performance.

In this talk, I will present our latest survey of state-of-the-art monitoring tools deployed on modern HPC system using lists such as TOP500, Green500, IO500, and the Comprehensive Data Center List (CDCL). Then, we will introduce our latest efforts to tackle the opaque monitoring infrastructure that explores the user and kernel I/O stack to uncover causality relationships for achieving eXplainable I/O (XIO).

Primary author

Hari Hariharan Devarajan (LLNL)

Presentation Materials

There are no materials yet.