30 November 2025 to 3 December 2025
Century City Conference Centre
Africa/Johannesburg timezone
PLEASE NOTE: Registrations Have Closed! Contact chpc@csir.co.za for further queries.

Recognising South African Voices: A Multilingual ASR Pipeline

3 Dec 2025, 16:20
15m
1/1-7 - Room 7 (Century City Conference Centre)

1/1-7 - Room 7

Century City Conference Centre

50
Invited Talk DIRISA DIRISA

Speaker

Ms Mahlatse Mbooi (CSIR)

Description

South Africa’s rich linguistic diversity poses unique challenges for artificial intelligence systems, particularly in automatic speech recognition (ASR) where multilingual speakers frequently switch languages mid-conversation. This study proposes a robust ASR pipeline tailored for code-switched speech in health settings, addressing practical issues such as overlapping dialogue, background noise, and inconsistent language usage. The pipeline will integrate multilingual acoustic models and language-specific preprocessing techniques, trained on a standardised dataset comprising South African languages including isiZulu, Sepedi and English.
By focusing on pipeline design, dataset standardisation and multilingual integration, this work demonstrates how AI can be built to truly understand South African voices rather than ignoring them. Structured and reproducible approaches to code-switched data lay the foundation for inclusive, fair, and context-aware AI that represents local language communities and highlight the broader opportunities for leveraging multilingual data responsibly.

Institute CSIR, MRatsoma@csir.co.za
Presenting Author Mahlatse Mbooi

Primary author

Ms Mahlatse Mbooi (CSIR)

Presentation Materials