Centre for High Performance Computing 2025 National Conference

Name: Centre for High Performance Computing 2025 National Conference
Start: 2025-11-30T08:00:00+02:00
End: 2025-12-03T22:20:00+02:00
Location: Century City Conference Centre

30 November 2025 to 3 December 2025

Century City Conference Centre

Africa/Johannesburg timezone

PLEASE NOTE: Registrations Have Closed! Contact chpc@csir.co.za for further queries.

Info

helpdesk@chpc.ac.za

Recognising South African Voices: A Multilingual ASR Pipeline

3 Dec 2025, 16:20

15m

1/1-7 - Room 7 (Century City Conference Centre)

1/1-7 - Room 7

Century City Conference Centre

Invited Talk DIRISA DIRISA

Ms Mahlatse Mbooi (CSIR)

South Africa’s rich linguistic diversity poses unique challenges for artificial intelligence systems, particularly in automatic speech recognition (ASR) where multilingual speakers frequently switch languages mid-conversation. This study proposes a robust ASR pipeline tailored for code-switched speech in health settings, addressing practical issues such as overlapping dialogue, background noise, and inconsistent language usage. The pipeline will integrate multilingual acoustic models and language-specific preprocessing techniques, trained on a standardised dataset comprising South African languages including isiZulu, Sepedi and English.
By focusing on pipeline design, dataset standardisation and multilingual integration, this work demonstrates how AI can be built to truly understand South African voices rather than ignoring them. Structured and reproducible approaches to code-switched data lay the foundation for inclusive, fair, and context-aware AI that represents local language communities and highlight the broader opportunities for leveraging multilingual data responsibly.

Institute	CSIR, MRatsoma@csir.co.za
Presenting Author	Mahlatse Mbooi

Ms Mahlatse Mbooi (CSIR)

Recognising_South_African_Voices.docx

Centre for High Performance Computing 2025 National Conference

Info

Recognising South African Voices: A Multilingual ASR Pipeline

1/1-7 - Room 7

Century City Conference Centre

Speaker

Description

Primary author

Presentation Materials