🔥 MoTaCon tickets are hot! Get yours today! 🔥

Low-resource Languages

Languages that are significantly underrepresented in computational datasets and NLP research, typically because the volume of digitised text available for training is small relative to dominant languages such as English.

So what? Over 90% of the world's languages are classified as low-resource; this imbalance means AI models perform unevenly across linguistic and cultural contexts, producing representational injustice for speakers of those languages.

Example: A multilingual model trained predominantly on English-language web data may perform well for English speakers but generate unreliable or culturally inappropriate outputs for speakers of languages with limited digital corpora.

Source: https://botpopuli.net/llm-extractivism-and-the-politics-of-the-knowledge-commons/

Rosie Sherry

22nd June 2026

Add Definition

Explore MoT

MoTaCon 2026

Thu, 1 Oct

A tech conference to help you navigate the ever-shifting landscape of Quality Engineering, AI, Leadership, Product, Accessibility and Security.

Prompting for Testers

Unleash the power of generative AI to boost your software testing and day-to-day tech tasks

02 Sep 24

Course

Into The Motaverse

Into the MoTaverse is a podcast by Ministry of Testing, hosted by Rosie Sherry, exploring the people, insights, and systems shaping quality in modern software teams.

Low-resource Languages

Modern Systems Need Modern Testing

MoTaCon 2026