Asset Publisher

Media Lens King, stock.adobe.com

Facts and Findings

AI in Subsahara-Africa – Afrofuturism without African languages?

by Jan-Ole Voß

The challenges for large language models

As a technology, AI has enormous potential to make many processes simpler, faster and more efficient. AI would therefore be suitable for so-called leap-frogging. If AI were integrated quickly and well, especially the global South could benefit from it. However, large amounts of data are required for the development of large language models (e.g. ChatGPT). This could slow down the spread of AI on the African continent. Liberal and transparent data policy should therefore play a central role in the political discourse there.

Asset Publisher

So-called large language models (LLMs) are designed to recognise, understand and reproduce language. An LLM trained with appropriate data can generate speech based on predictions. The best-known example of LLMs is GPT-3.5 from OpenAi. Sub-Saharan Africa and Europe have a difficult time developing large language models. Common reasons include the diversity of spoken languages and a lack of available data for training LLMs. The ongoing development of AI models promises a far-reaching increase in efficiency and facilitates access to medical diagnoses. 

Initial developments of LLMs in regionally dominant languages are focussing on those languages that cover the largest markets. This could reduce linguistic diversity in the digital space. Especially the African continent could be impacted. Around a third of all languages spoken worldwide originate from sub-Saharan Africa. Estimates put the total number of African languages at between 1,000 and 2,000. There are currently 75 African languages, spoken by one million or more people. Many of the up to 2,000 languages exist only in oral tradition; there are often no standardised written traditions, let alone digitised data sets. As LLM development is very complex and costly, many people are calling for the development of digitalised data sets. As LLM development is very complex and costly, many fear that numerous smaller languages will be increasingly squeezed out of the digital space over time. Innovative, efficiency- and participation-enhancing AI solutions already exist and offer great potential for human and economic development in sub-Saharan Africa, but require good framework conditions. In order to promote the dissemination of AI applications, increased cooperation with data collection institutions is recommended, and consideration should also be given to promoting liberal data protection regulations among political partners.

Read the entire analysis: "KI in Subsahara-Afrika. Ein Afrofuturismus ohne afrikanische Sprachen?" here as PDF.

Asset Publisher

Contact

Jan-Ole Voß

Jan-Ole Voß

Deputy Head of the Kenya Office and Project Manager

jan-ole.voss@kas.de +254 20 2610021/2

comment-portlet

Asset Publisher