Centre for High Performance Computing 2021 National Conference

Name: Centre for High Performance Computing 2021 National Conference
Start: 2021-12-01T10:00:00+02:00
End: 2021-12-03T20:00:00+02:00
Location: No location set

1-3 December 2021

Africa/Johannesburg timezone

Conference Videos Available

Info

helpdesk@chpc.ac.za

Processing longitudinal population data using CHPC

1 Dec 2021, 15:15

30m

Talk NICIS Cloud Projects NICIS Cloud Projects

Dr Kobus Herbst (SAPRIN)

The South African Population Research Infrastructure Network (SAPRIN) curates longitudinal population data collected by four nodes from a total population of more than 400 000 individuals. Due to the dynamic nature of these study populations data representing episodes of individual surveillance needs to be combined in a way that maintains data integrity and takes into account variations between data collection sites.

We need to deconstruct 4,5 million person years of observation into a day level dataset, requiring the kind of processing and storage capacity provided by a high performance computing environment such as CHPC.

We will describe a data processing pipeline, originally developed in Pentaho and recently converted to the julia programming language which scales well on the CHPC environment.

Dr Kobus Herbst (SAPRIN) Molulaqhooa Maoyi (SAPRIN) Tinofa Mutevedzi (SAPRIN) Mark Collinson (SAPRIN)

There are no materials yet.

Centre for High Performance Computing 2021 National Conference

Info

Processing longitudinal population data using CHPC

Speaker

Description

Primary authors

Presentation Materials