African Next Voices Featured in Nature: Reshaping Speech Technology for Africa
Nature profiles the African Next Voices project, the largest multilingual speech data initiative in Africa β and DSFSI is proud to lead the South African leg. Learn how weβre making African languages AI-ready and inviting partners to build with us.

π° Weβre in Nature!
π
Published: 29 July 2025
π Full article on Nature.com
The African Next Voices (ANV) project β a pan-African initiative to address the deep underrepresentation of African languages in AI β has been profiled in Nature News. With over 9,000 hours of recorded speech in 18 languages across South Africa, Kenya, and Nigeria, this work is the most extensive open dataset creation effort of its kind on the continent.
βAI models are neglecting African languages β scientists want to change that.β β Nature, July 2025
DSFSI Leads the South African Leg πΏπ¦
At the Data Science for Social Impact group (DSFSI) at the University of Pretoria, weβre proud to be coordinating the South African arm of ANV, which focuses on seven languages:
isiZulu, isiXhosa, Sesotho, Setswana, Sepedi, isiNdebele, and Tshivenda.
Our work spans:
- ποΈ Community-driven data collection and transcription
- π§ͺ Aligning data to key social sectors like education, health, and agriculture
- π Supporting downstream use in speech recognition, translation, and language models
Ethically Licensed, Openly Available π
All datasets are being released under open and permissive licensing that ensures public access β while requiring appropriate attribution to support transparency, research recognition, and responsible use. We believe that equitable AI starts with equitable data.
π Explore the project and South African resources here:
π https://www.dsfsi.co.za/za-african-next-voices/
π₯ Call to Action: Letβs Collaborate!
Are you a researcher, developer, civil society organization, or product team looking to work with speech data in African languages?
We want to hear from you!
π© Reach out via dsfsi.info@up.ac.za β letβs explore how you can use, contribute to, or extend this growing multilingual resource for Africaβs AI future.