30 November 2025 to 3 December 2025
Century City Convention Centre
Africa/Johannesburg timezone
Invoice request deadline is this Friday 31 October 2025.

Federated Computing for Health Data Science in Africa

Not scheduled
20m
BICC.G-D1 - D1 Tsitsikamma (Boardwalk Convention Centre)

BICC.G-D1 - D1 Tsitsikamma

Boardwalk Convention Centre

120
Talk HPC Technology HPC Technology

Speaker

Mr Takudzwa Nyasha Musarurwa (University of Cape Town - eLwazi ODSP)

Description

There is a great need to develop computing infrastructure to support the increased application of data science and health informatics across Africa which includes robust data sharing and federated computing, whilst fostering research collaboration. The Global Alliance for Genomics and Health (GA4GH; https://www.ga4gh.org/) aims to promote responsible data sharing standards through the use of open, community derived standards and APIs such as Data Repository Service (DRS), Workflow Execution Service (WES), Data Connect, Passports and Tool Registry Service (TRS), amongst others. The DRS API provides a generic interface to access data in repositories. The data is discovered through the Data Connect API which supports federated search of different kinds of data. The WES API provides a standardized approach for accessing computing resources with use of reproducible workflows, usually housed in a tools registry service such as Dockstore (https://dockstore.org/). The eLwazi Open Data Science Platform (ODSP) has undertaken a pilot implementation of the GA4GH standards with the aim of delivering a federated framework for data discovery and analysis within Africa for the DS-I Africa consortium. The eLwazi GA4GH pilot project was started in June 2023 as an outcome of a training hackathon by the eLwazi ODSP Infrastructure work group in collaboration with the GA4GH. The main goal of the GA4GH pilot project is to enable the findable, accessible, interoperable and reusable (FAIR) principles for data discovery and analysis. Four sites within Africa (Ilifu - South Africa, ACE Lab - Mali, ACE Lab - Uganda and UVRI - Uganda) are currently hosting the different API endpoints for authorized data discovery and analysis. From within the project we can locate DRS datasets using the Data Connect API, use workflows from Dockstore via the TRS API for reproducible analysis, and submit it to the WES API for analysis without the data leaving the actual location, which provides a technical solution for data analysis within legislative data protection constraints. We are now in the process of developing a federated approach for the imputation of African genomics data as a GA4GH implementation forum (GIF) project collaboration based on the lessons from the pilot GA4GH implementation project.

Institute University of Cape Town
Presenting Author Takudzwa N Musarurwa
Registered for the conference? No

Primary authors

Mr Takudzwa Nyasha Musarurwa (University of Cape Town - eLwazi ODSP) Mr Aboubacar Maiga (ACE Labs - Mali) Henry Mutegeki (ACE Labs - Uganda) Mr Timothy Wamala (Uganda Virus Research Onstitute) Mr Sidy Soumare (NIH ICER Mali) Rodgers Kimera Dane Kennedy Dr Hocine Bendou (University of Cape Town) Mr Ayton Meintjes (University of Cape Town - eLwazi ODSP) Mr Mamana Mbiyavanga (University of Cape Town - eLwazi ODSP) Amos Bunde Mr Miro Cupak (DNAStack) Ben Vizzier Jonathan Kayondo Dr Daudi Jjingo (ACE Labs - Uganda) Seydou Doumbia Rob Simmonds, Dr Sumir Panji (University of Cape Town - eLwazi ODSP) Brian O'Connor (GA4GH) Mr Gerrit Botha (Univeristy of Cape Town - eLwazi ODSP) Prof. Nicola Mulder (University of Cape Town - eLwazi ODSP)

Presentation Materials

There are no materials yet.