Centre for High Performance Computing 2023 National Conference

Name: Centre for High Performance Computing 2023 National Conference
Start: 2023-12-04T08:00:00+02:00
End: 2023-12-07T21:00:00+02:00
Location: Skukuza

4-7 December 2023

Skukuza

Africa/Johannesburg timezone

Please support the 2024 conference by completing the short survey.

Info

helpdesk@chpc.ac.za

® I/O behavior of scientific deep learning workloads

6 Dec 2023, 14:10

20m

1-1-2 - Ndau (Skukuza)

1-1-2 - Ndau

Skukuza

Talk Special

Hariharan Devarajan (Lawrence Livermore National Laboratory)

Deep learning has been shown as a successful method for various tasks, and its popularity results in numerous open-source deep learning software tools. Deep learning has been applied to a broad spectrum of scientific domains such as cosmology, particle physics, computer vision, fusion, and astrophysics. Scientists have performed a great deal of work to optimize the computational performance of deep learning frameworks. However, the same cannot be said for I/O performance. As deep learning algorithms rely on big-data volume and variety to effectively train neural networks accurately, I/O is a significant bottleneck on large-scale distributed deep learning training.

In this talk, I aim to provide a detailed investigation of the I/O behavior of various scientific deep learning workloads running on the Theta cluster at Argonne Leadership Computing Facility. In this talk, I present DLIO, a novel representative benchmark suite built based on the I/O profiling of the selected workloads. DLIO can be utilized to accurately emulate the I/O behavior of modern scientific deep learning applications. Using DLIO, application developers and system software solution architects can identify potential I/O bottlenecks in their applications and guide optimizations to boost the I/O performance leading to lower training times by up to 6.7x.

Hariharan Devarajan (Lawrence Livermore National Laboratory)

There are no materials yet.

Centre for High Performance Computing 2023 National Conference

Info

® I/O behavior of scientific deep learning workloads

1-1-2 - Ndau

Skukuza

Speaker

Description

Primary author

Presentation Materials