Centre for High Performance Computing 2022 National Conference

Name: Centre for High Performance Computing 2022 National Conference
Start: 2022-11-30T08:00:00+02:00
End: 2022-12-02T21:00:00+02:00
Location: CSIR ICC

30 November 2022 to 2 December 2022

CSIR ICC

Africa/Johannesburg timezone

The conference is now live. Late registrations for the physical conference can be made at the conference venue in Pretoria.

Info

helpdesk@chpc.ac.za

Time to Revisit Erasure Codes in Data-intensive Clusters

2 Dec 2022, 12:30

30m

ICC-G-Emerald - Emerald Auditorium (CSIR ICC)

ICC-G-Emerald - Emerald Auditorium

CSIR ICC

Talk Storage and IO HPC

Shadi Ibrahim (Inria)

Replication has been successfully employed and practiced to ensure high data availability in large-scale distributed storage systems. However, with the relentless growth of generated and collected data, replication has become expensive not only in terms of storage cost but also in terms of network cost and hardware cost. Traditionally, erasure coding (EC) is employed as a cost-efficient alternative to replication when high access latency to the data can be tolerated. However, with the continuous reduction in its CPU overhead, EC is performed on the critical path of data access. For instance, EC has been integrated into the last major release of Hadoop Distributed File System (HDFS) which is the primary storage backend for data analytic frameworks such as Hadoop and Spark. This talk explores some of the potential benefits of erasure coding in data-intensive clusters and discusses aspects that can help to realize EC effectively for data-intensive applications.

Shadi Ibrahim (Inria)

There are no materials yet.

Centre for High Performance Computing 2022 National Conference

Info

Time to Revisit Erasure Codes in Data-intensive Clusters

ICC-G-Emerald - Emerald Auditorium

CSIR ICC

Speaker

Description

Primary author

Presentation Materials