GitHunt
FR

francescobaio/NLP_Assignments

Assignments in the realm of Natural Language Processing for Sexism Detection, developed as part of the NLP course at the University of Bologna.

NLP_Assignments

Overview of the Tasks

This repository addresses two key challenges in sexism detection using Natural Language Processing (NLP):

EXIST 2023 Task 1

🌟 Objective: Detecting and classifying sexism in textual data extracted from tweets.

πŸ” Overview:

Our analysis compared the performance of LSTM and Transformer models, with RoBERTa significantly outperforming the Custom Model. This success is attributed to RoBERTa’s larger number of parameters, Pretraining on a more extensive corpus and Fine-tuning on the specific task.

βœ… Key Takeaways:

  • Pre-trained transformer architectures like RoBERTa are highly effective for complex tasks such as sexism detection.
  • Overassociation of the term β€˜woman’ with sexism, leading to systematic misclassifications.

EDOS Task A

🌟 Objective: Classifying sexist content in textual data extracted from tweets.

πŸ” Overview:

This task focuses on the exploration of Prompting techniques using Large Language Models (LLMs): the goal is to assess their capability to understand nuanced contexts in sexist content and classify complex and ambiguous cases effectively.

βœ… Key Takeaways:

  • Few-shot prompting proves to be the most effective technique for this type of task.
  • The Chain of Thought (CoT) technique positively impacts model performance by enhancing reasoning capabilities.
  • Experiments on a smaller dataset (CLEF EXIST Task 1, 2023) highlight the importance of fine-tuning, which:
    • Enables the use of lightweight models that outperform larger LLMs.
    • Aligns with recent research findings (Bucher and Martini, 2024).

πŸ“‚ Project Structure

Here’s an overview of the repository structure:

.
β”œβ”€β”€ assignment_1/           # Files related to Assignment 1
β”‚   β”œβ”€β”€ html/  # HTML report for Assignment 1
β”‚   β”œβ”€β”€ assignment_1.ipynb                          # Jupyter notebook
β”‚   └── assignment_1.pdf                            # PDF report for Assignment 1
β”‚
β”œβ”€β”€ assignment_2/           # Files related to Assignment 2
β”‚   β”œβ”€β”€ html/  # HTML report for Assignment 2
β”‚   β”œβ”€β”€ assignment_2.ipynb                          # Jupyter notebook
β”‚   └── assignment_2.pdf                            # PDF report for Assignment 2
β”‚
β”œβ”€β”€ docs/                   # Documentation files
β”‚   └── index.html          # Main entry for online documentation
β”‚
β”œβ”€β”€ LICENSE                 # Project license
β”œβ”€β”€ README.md               # This file
β”œβ”€β”€ _quarto.yml             # Quarto configuration file
└── index.qmd               # Source file for the homepage

πŸ“© Contacts

πŸ“– Documentation

Access the full documentation and assignment reports directly from this page: Online Documentation

Languages

Jupyter Notebook97.4%JavaScript2.6%

Contributors

MIT License
Created January 16, 2025
Updated January 16, 2025