HALT-PROP: Human-annotated Lithuanian textual corpus for propaganda narratives and techniques

Direct Link:
Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Straipsnis / Article
Language:
Anglų kalba / English
Title:
HALT-PROP: Human-annotated Lithuanian textual corpus for propaganda narratives and techniques
In the Journal:
Scientific data, 2026, 13, 47, 1-13
Summary / Abstract:

ENIn the contemporary technological landscape, propaganda has become one of the most pervasive tools in information warfare. Social media platforms and entire media ecosystems are leveraged to disseminate hostile propaganda aimed at polarizing societies, destabilizing states, and eroding longstanding democratic processes. Malign propaganda is not only common in widely-spoken languages but also targets less-spoken languages to maximize its reach and influence. While progress has been made in developing models capable of detecting propaganda, most advances have focused on high-resource languages. In contrast, low-resource languages continue to face significant limitations, the most critical being the scarcity of annotated datasets. In many regions and countries, such resources are entirely absent. To address this gap, we present the HALT-PROP dataset, the first human-annotated Lithuanian textual propaganda corpus. The corpus comprises two complementary datasets: (1) 2,870 news articles manually labeled by five experts at the article level to identify the presence of propaganda; and (2) a subset of 1,000 articles annotated for specific propaganda techniques and narratives using a cross-annotation approach.

DOI:
10.1038/s41597-025-06367-w
ISSN:
2052-4463
Subject:
Permalink:
https://www.lituanistika.lt/content/88304
Updated:
2026-03-04 11:06:45
Metrics:
Views: 4
Export: