GitHunt
IL

IldikoPilan/llm_irony

Data used for testing LLMs' abilities to detect irony. The experiments are described in a paper accepted at SemDial 2025.

Description

This repository contains the data used in the paper: Margareta Berg, Ildikó Pilán, Ingrid Lossius Falkum and Pierre Lison. Pragmatic Reasoning for Irony Detection with Large Language Models in English and Norwegian. To appear in the Proceedings of the SemDial 2025 Workshop Series on the Semantics and Pragmatics of Dialogue. (The link to the published paper will be added soon.)

Part of this material is adapted from a previous pragmatics experiment on irony and perspective-taking in children (Köder and Falkum, 2021), centered around simple situations involving a child and an adult. In the LLM-adapted version, each task was subdivided into two prompts: one consisting of a short context and a question about the adult speaker’s intent, and another containing the child’s action, the adult’s reaction and a question about the adult’s emotion. We added two follow-up questions for each task for investigating the presence of irony with LLMs: an indirect and a direct one. We complemented the original 12 stories with 24 new unique stories. The final dataset thus comprises 108 items derived from 36 unique stories, each associated with one of three possible outcomes: irony, praise, or criticism – the latter two representing non-ironic reactions.

References

Franziska Köder and Ingrid Lossius Falkum. (2021). Irony and perspective-taking in children: The roles of norm violations and tone of voice. Frontiers in Psychology, 12:624604.

Data structure

Each JSON file consists of a list of dictionaries with the following keys:

  • want: sentences introducing the situation and asking about what the adult wants;
  • emotion: description of an even occurring followed by a question about the adult's emotion;
  • gold_label: the manually annotated label indicating the condition, one of 'irony', 'praise' or 'criticism';
  • item_type: 'original' if from the original study with human participants, 'original_with_new_condition' if the original story was re-written with a new condition, 'new' if the item was added for the LLM-based study;
  • item_id: a running number identifying each item, consistent across the two languages.

Contributors

MIT License
Created August 7, 2025
Updated August 8, 2025