2016
Autores
Rocha, C; Jorge, A; Sinoara, RA; Brito, P; Pimenta, C; Rezende, SO;
Publicação
CoRR
Abstract
2026
Autores
Evans, JP; Cunha, LF; Silvano, P; Jorge, A; Guimarães, N; Nunes, S; Campos, R;
Publicação
CoRR
Abstract
2026
Autores
Isidro, J; Cunha, LF; Silvano, P; Jorge, A; Guimarães, N; Nunes, S; Campos, R;
Publicação
CoRR
Abstract
2025
Autores
Guimarães, N; Silvano, P; Campos, R; Jorge, AM; Pacheco, AF; Dimitrov, DI; Nikolaidis, N; Yangarber, R; Sartori, E; Stefanovitch, N; Nakov, P; Piskorski, J; San Martino, GD;
Publicação
EMNLP (Findings)
Abstract
We present NarratEX, a dataset designed for the task of explaining the choice of the Dominant Narrative in a news article, and intended to support the research community in addressing challenges such as discourse polarization and propaganda detection. Our dataset comprises 1,056 news articles in four languages, Bulgarian, English, Portuguese, and Russian, covering two globally significant topics: the Ukraine-Russia War (URW) and Climate Change (CC). Each article is manually annotated with a dominant narrative and sub-narrative labels, and an explanation justifying the chosen labels. We describe the dataset, the process of its creation, and its characteristics. We present experiments with two new proposed tasks: Explaining Dominant Narrative based on Text, which involves writing a concise paragraph to justify the choice of the dominant narrative and sub-narrative of a given text, and Inferring Dominant Narrative from Explanation, which involves predicting the appropriate dominant narrative category based on an explanatory text. The proposed dataset is a valuable resource for advancing research on detecting and mitigating manipulative content, while promoting a deeper understanding of how narratives influence public discourse.
2026
Autores
Mourthé, A; Mello, CE; Jorge, A;
Publicação
SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2025, PT I
Abstract
As recommender systems play an increasingly central role in shaping information exposure on platforms like YouTube, understanding the nature of the content they promote, especially in sensitive contexts, requires scalable and reliable labelling methods. This paper investigates the use of Large Language Models (LLM) to label YouTube videos based solely on their metadata. We propose a committee-based approach that aggregates predictions from an ensemble of seven state-of-the-art LLMs through majority voting. Using a novel dataset collected via simulated user interactions on YouTube, we analyse model agreement, labelling behavior, and the influence of model size. To assess label reliability, we also investigate the semantic coherence of label assignments. Our results show that LLM committees produce highly consistent labels in low-disagreement settings. These findings highlight both the promise and limitations of LLM-based annotation for auditing social networks.
2026
Autores
Campos, R; Jorge, A; Jatowt, A; Bhatia, S; Litvak, M;
Publicação
ECIR (3)
Abstract
For eight years, the Text2Story Workshop series has fostered a vibrant research community dedicated to narrative understanding, advancing shared insights into the challenges of modelling narrative structure in text. While earlier approaches laid important foundations, recent progress in Transformers and Large Language Models (LLMs) has fundamentally reshaped the field. Building on the increasing prominence of LLM-based contributions in recent editions, the ninth edition of Text2Story expands the focus toward agentic AI, where systems plan, reason, and interact over time using narratives as internal representations. Recent advances, including long-context architectures, instruction and preference-tuned models, retrieval-augmented generation, and discourse-aware prompting, have broadened the applicability of LLMs to complex narrative tasks. Nevertheless, reliably capturing fine-grained narrative structures remains challenging, particularly for event chains, temporal and causal relations, character development, and perspective consistency. These challenges are amplified in interactive and agentic settings, where narrative coherence, controllability, and reliability are critical. This edition of Text2Story explores both the opportunities and limitations of LLMs and agentic systems for narrative understanding, including the analysis of narratives generated by LLMs themselves with respect to consistency, hallucination, bias, and control. Through a diverse program of research papers, works in progress, demos, resources, and keynote talks, the workshop continues to advance narrative understanding in the era of foundation and agentic models.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.