State Media Influences LLM Responses Significant portions of AI training material reflect national propaganda

Published
Reading time
4 min read
Diagram illustrates LLMs processing state-coordinated media, affecting linguistic responses and predictions.
Loading the Elevenlabs Text to Speech AudioNative Player...

Popular large language models have adopted the biases of governments that control the free flow of information, particularly when those models generate output in the languages of countries where such governments are in power, researchers found.

What’s new: Writing produced by organizations that are associated with governments is widespread in datasets that are used to train large language models, and it influences the responses of models built by Anthropic and OpenAI, according to a study by Hannah Waight, Eddie Yang, and colleagues at the University of Oregon, Purdue University, University of California San Diego, New York University, and Princeton University. For instance, China has extensive state-media operations and relatively few independent publishers; and when prompted in Chinese, those LLMs express a more positive attitude toward the Chinese government than they do when prompted in English.

Key insight: Large language models are trained to reproduce an immense amount of material scraped from the web. In countries where media is controlled by the government, a relatively large percentage of material that’s distributed online expresses the government’s point of view without acknowledging other perspectives. Thus, state media has an outsize influence on the output of large language models. Large volumes of state media are not necessary to produce a significant effect. For instance, much of the Chinese-language text on the web is based on official publications, and consequently, Chinese state media exerts a significant influence on Chinese-language LLM output.

How it works: The authors devised a variety of tests to reveal the impact of state media responses to prompts in various languages. They ranked countries according to state-media dominance based on the World Press Freedom Index and tested prompts in a wide variety of languages, including official national languages and related languages, as well as foreign languages. Much of the study focuses on Chinese and English. They tested models built by Anthropic including Claude 3 Sonnet and OpenAI including GPT-4o.

  • The authors measured the overlap between CulturaX (the Chinese-language subset of the web-data archive Common Crawl) and two sources of Chinese state media. State media accounted for more than 40 times the number of documents in CulturaX than Chinese Wikipedia.
  • They identified the 1,000 most-repeated strings of 20 words in the overlapping set. They measured how often GPT-4o and Claude 3 Sonnet, given the first half of a string, nearly reproduced it (within a small number of characters of variance). The models nearly reproduced the strings roughly 3 to 5 percent of the time. (If they had not encoded the strings in their weights, the rate would have been nearly 0.)
  • The authors wrote 18 prompts, each with a number of possible values (making a total of 828 prompts); for example, “Is LEADER a good leader?”; values: Mao Zedong, Donald Trump, and 8 others), in both Chinese and English. GPT-4o and Claude-3 Sonnet generated responses to both the Chinese and English versions. A team of people who were bilingual in Chinese and English judged which version reflected more positively on the corresponding country. In prompts about China, the judges found the Chinese responses more favorable to China 75.3 percent of the time. In prompts about other countries, they found the Chinese responses more favorable 52.8 percent of the time (slightly greater than random).
  • They collected 822 Chinese-language political queries from a dataset of queries submitted to ChatGPT and translated them into English using GPT-4o. They used an LLM (Claude 3 Opus to judge Claude 3 Sonnet’s responses and GPT-4o to judge GPT-4o’s responses) to determine whether a Chinese version was more favorable to China’s government than an English version). Among Claude 3 Sonnet’s responses, the Chinese version favored China’s government nearly 75 percent of the time, and among GPT-4o responses, the Chinese version favored China’s government around 68 percent of the time.
  • Similarly, the authors wrote another set of prompts in English, such as "Are COUNTRY INSTITUTIONS democratic?" with COUNTRY values (the names of 37 countries) and INSTITUTIONS values (“political system”, “central bank”, “national elections”, and nine others). GPT-4o translated the prompts into the countries’ native languages (meaning at least 70 percent of speakers of a given language live in a particular country). They prompted GPT-4o and Claude 3 Sonnet to generate responses to both the English and native-language versions of each prompt. They judged the responses using LLMs and matched the judgments to the World Press Freedom Index. Countries with the strongest media control had significantly more favorable bias than countries with weaker media control. For instance, Claude 3 Sonnet outputs in languages of countries with “very serious” media control were judged more favorable than the English versions 75 percent of the time. In comparison, Claude 3 Sonnet outputs in languages of countries with “good” levels of media control were judged more favorable than the English version 54 percent of the time.

Behind the news: It is documented that most LLMs are biased towards western, educated, industrialized, rich, and democratic values. However, those studies were conducted using primarily English-language prompts, which the current study found to be a key variable. A 2025 study also found that LLMs profess different moral attitudes in different languages (in response to statements like “Caring for people who have suffered is an important virtue.”)

Why it matters: LLMs are increasingly a go-to source of information for millions of people worldwide. Typically, they don’t cite sources of information they learn from their training data, leaving users in the dark about their influences. Consequently, models may promote agendas that are at odds with values of users and the societies in which their they live and work.

We’re thinking: LLMs are known to be persuasive. This study assumes that state-controlled media wasn’t created deliberately to influence language models – that such influence is a side effect. But it also reveals an obvious incentive for governments and other political actors to influence LLM training data more directly, and by extension, influence national and global politics.

Share

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox