Read The Times Australia

Daily Bulletin

AI systems are built on English – but not the kind most of the world speaks

  • Written by: Celeste Rodriguez Louro, Associate professor, Chair of Linguistics and Director of Language Lab, The University of Western Australia
AI systems are built on English – but not the kind most of the world speaks

An estimated 90% of the training data for current generative AI systems stems from English. However, English is an international lingua franca with about 1.5 billion speakers worldwide, and countless varieties.

So whose English is today’s technology based on? The answer is primarily the English of mainstream America.

This is no accident. Mainstream American English is entrenched in the digital infrastructure of the internet, in Silicon Valley’s corporate priorities, and in the data sets that fuel everything from autocorrect to AI-generated synthetic text.

The consequence? AI models produce a monolithic version of English that erases variation, excludes minoritised and regional voices, and reinforces unequal power dynamics.

The hegemony of mainstream American English

The proliferation of American English online is a result of historical, economic and technological factors. The United States has been a dominant force in the development of the internet, content creation, and the rise of tech giants such as Google, Meta, Microsoft and OpenAI.

Unsurprisingly, the linguistic norms embedded in products by these companies are overwhelmingly mainstream American.

A recent study found that speakers of non-mainstream English were frustrated with the “homogeneity of AI accents” in voice-cloning and speech-generation technologies. One participant noted the predominant mainstream American accents in the voices available, stating the technologies had been built “with some other people in mind”.

Mainstream varieties of English have long reigned as the “standard” against which other varieties are weighed.

To take a single example from the US, linguistics research by John Baugh found that using different accents can determine people’s access to goods and services. When Baugh called different landlords about housing advertised in the local newspaper, using a mainstream accent procured him several housing inspections while using African-American and Latino accents did not.

The prestige of mainstream English also underpins algorithmic decisions. The models behind tools such as autocorrect, voice-to-text, or even AI writing assistants are most often trained on mainstream American-centric data. This is often scraped from the web, where US-based media, forums and platforms dominate.

This means variations in grammar, syntax and vocabulary from other varieties of English are systematically ignored, misinterpreted or outright “corrected”.

Whose English is perceived as adding value?

The stakes of this linguistic bias in favour of mainstream English become even higher when AI systems are deployed around the world.

If an AI tutor fails to understand a Nigerian English construction, who bears the cost? If a job application written in Indian English is marked down by an AI-powered resume scanner, what are the consequences? If an Australian First Nations elder’s oral history is transcribed by voice recognition software and the system fails to capture culturally significant terms, what knowledge is lost or misrepresented?

These questions are unfolding in real time as governments, educational institutions and corporations adopt AI technologies at scale.

Englishes, not English

The idea that there is one “good” or “correct” English is a myth. English is spoken in diverse forms across regions, shaped by local societies, cultures, histories and identities.

As Noongar writer and educator Glenys Collard and I have written, Aboriginal English has “its own structure, rules and the same potential as any other linguistic variety” and the same is true of other forms of English.

Indian English, for example, has lexical innovations such as “prepone” (the opposite of postpone). Singapore English (Singlish) integrates particles and syntactic features from Malay, Hokkien and Tamil.

These are not “broken” forms of English. Each community where English was imposed has gone on to make English its own.

English, and language more generally, is never static. It adapts to meet the needs of an ever-changing society and its speakers.

Yet in AI development, this linguistic diversity is often treated as noise rather than signal. Non-standardised varieties are underrepresented in training datasets, excluded from annotation schemes, and rarely feature in evaluation benchmarks.

This results in an AI ecosystem that is multilingual in theory, but monolingual in practice.

Toward linguistic justice in AI

So, what would it look like to build AI systems that recognise and respect a range of different forms of English?

A shift in mindset is required, from prescribing “correct” language to including many varieties of language. What we need are systems that accommodate linguistic variation.

This may involve supporting community-led efforts to document and digitise linguistic varieties on their own terms, bearing in mind not all linguistic varieties should be digitised or documented.

Collaboration across disciplines is also important. It requires linguists, technologists, educators and community leaders working together to ensure AI development is grounded in principles of linguistic justice.

The goal is not to “fix” language but to create technology that produces just outcomes. The focus should be on changing the technology, not the speaker.

Embracing Englishes

English has been a powerful vehicle of empire, but it has also been a tool of resistance, creativity and solidarity. Around the world, speakers have taken the language and made it their own. AI-enabled systems should be built to be as inclusive of this variability as possible.

So next time your phone tells you to “correct” your spelling, or an AI chatbot misunderstands your phrasing, ask yourself: whose English is it trying to model? And whose English is being left out?

Authors: Celeste Rodriguez Louro, Associate professor, Chair of Linguistics and Director of Language Lab, The University of Western Australia

Read more https://theconversation.com/ai-systems-are-built-on-english-but-not-the-kind-most-of-the-world-speaks-249710

Business News

Reducing Sales Friction Through Centralized Content Delivery

Sales friction appears whenever buyers or sales teams face unnecessary obstacles in the buying journey. It can happen when information is hard to find, when messaging feels inconsistent, when product ...

Daily Bulletin - avatar Daily Bulletin

Why Choosing the Right Bollard Supplier Matters for Australian Businesses and Public Spaces

From busy CBD streetscapes to sprawling warehouse loading docks, bollards have become one of the most essential safety and security fixtures across Australia. Whether protecting pedestrians from veh...

Daily Bulletin - avatar Daily Bulletin

Why Modular Content Is Transforming Modern Marketing Teams

Modern marketing teams are expected to produce more content than ever before. They need to support websites, landing pages, email campaigns, social channels, product pages, sales enablement material...

Daily Bulletin - avatar Daily Bulletin

Everything You Need to Know About Getting Support from Optus

Whether you've been an Optus customer for years or you've just switched over, at some point you'll probably need to contact their support team. Maybe your bill looks different from what you expected. ...

Daily Bulletin - avatar Daily Bulletin

The Marketing Strategy That’s Quietly Draining Sydney Business Owners’ Bank Accounts

Sydney businesses are investing more in digital marketing than ever before. The intention is clear. More visibility should mean more leads, more customers, and steady growth. However, many business ...

Daily Bulletin - avatar Daily Bulletin

Why Mining Hose Solutions Are Essential For High-Performance Industrial Operations

In environments where the ground itself is constantly shifting, breaking, and being reshaped, every component must be built to endure. Mining operations are among the most demanding in the industria...

Daily Bulletin - avatar Daily Bulletin

The Reason Talented Teams Underperform

If you’re in business, you might have seen it before. A team of capable and smart people just suddenly slows down, and things start spiraling out of control. On paper, everything looks perfect, but ...

Daily Bulletin - avatar Daily Bulletin

Why More Aussie Tradies Are Moving Away From Paid Ads

Across Australia, a lot of tradies are busy. There’s no shortage of demand in industries like plumbing, electrical, landscaping, and building. But being busy doesn’t always mean running a smooth or...

Daily Bulletin - avatar Daily Bulletin

Why Careers In The Defence Industry Are Growing Rapidly

The defence sector has evolved far beyond traditional roles, opening doors to a wide range of opportunities across technology, engineering, intelligence, and operations. This is where defense industry...

Daily Bulletin - avatar Daily Bulletin

The Daily Magazine

Australia’s Best Walking Trails and the Shoes You Need to Tackle Them

Australia is not short on spectacular walks. You can follow ocean cliffs in Victoria, cross ancien...

Why Pre-Purchase Building Inspections Are Essential Before Buying a Home in Australia

source Have you ever walked through an open home and started picturing your furniture, family d...

5 Signs Your Car Needs Immediate Attention Before It Breaks Down

Car problems rarely appear without warning. In most cases, your vehicle gives clear signals before...

Ensuring Safety and Efficiency with Professional Electrical Solutions

For businesses in Newcastle, a safe and fully functioning workplace remains a key part of day-to-d...

Choosing The Right Bin Hire Solution For Hassle-Free Waste Management

When it comes to managing waste efficiently, finding the right solution can save both time and eff...

Why Cleanliness Is Critical In Childcare Environments

Children explore the world with curiosity, often touching surfaces, sharing toys, and interacting ...

What to Look for in a Reliable Australian Engineering Partner

Choosing an engineering partner is rarely just about technical capability. Most businesses can fin...

How to Choose a Funeral Home That Supports Families with Care

Choosing a funeral home is rarely something families do under ideal circumstances. It often happen...

Why Premium Coffee Matters in Modern Hospitality Venues

In hospitality, details shape perception long before a guest consciously evaluates them.  Lightin...