Estonian Language Institute Releases Benchmark for LLMs on Propaganda Resistance

The Estonian Language Institute has launched a benchmark to assess large language models' ability to resist Russian propaganda. This initiative aims to address concerns about the potential spread of misleading narratives by evaluating LLMs across various categories of Russian influence operations.

The Estonian Language Institute (ELI) has introduced a new benchmark called "Propaganda Resistance" to evaluate the effectiveness of large language models (LLMs) in avoiding narratives associated with Russian propaganda. This initiative comes as concerns grow among state governments regarding the potential for LLMs to disseminate misleading information from foreign adversaries.Estonia, a former Soviet republic, remains vigilant against perceived false narratives from Russia. In collaboration with the volunteer-run defense group Propastop, the ELI identified 14 categories of Russian influence operations that could affect public discourse. These categories include topics related to Crimea, the war in Ukraine, NATO's history, and Russia's annexation of Baltic states during World War II.Researchers formulated questions for each propaganda category, which were designed to be neutral, biased with false assumptions, or intended to provoke misinformation from the LLMs. The questions were presented in English, Estonian, and Russian, and the responses were evaluated by an AI model aligned with Propastop experts to assess the LLMs' ability to counter propaganda narratives without external assistance.

Estonian Language Institute Releases Benchmark for LLMs on Propaganda Resistance

Bias Analysis

Bias Indicators Removed

Original vs. Neutral

Get the neutral brief

Bias Analysis

Bias Indicators Removed

Original vs. Neutral