Sexism Detection

This project, developed as part of the Natural Language Processing (NLP) course at the University of Bologna (UniBo), addresses the problem of sexism detection in text. The goal is to classify whether a given text (tweets) contains or describes sexist expressions or behaviors. The project explores a range of modern NLP techniques, including LSTM-based models, Transformer-based models, and Large Language Models (LLMs).

Introduction

This project tackles the challenge of sexism detection using a variety of modern NLP techniques. It is divided into two main assignments:

Assignment 1: Focuses on LSTM-based models and Transformer-based models for sexism detection.

Dataset: A small version of EXIST dataset Github repository.
Assignment 2: Explores Large Language Models (LLMs) for zero-shot and few-shot prompting for sexism detection.

Dataset: A small test set version of EDOS Github repository.

Approaches

Assignment 1: LSTM and Transformer-Based Models

LSTM-Based Models

Three LSTM-based models were implemented:

Baseline Model: A Bidirectional LSTM with a Dense layer on top.
Model 1: Extends the Baseline by adding an additional LSTM layer.
Model 2: Uses two LSTM layers with the same hidden dimension.

Transformer-Based Models

The project fine-tuned the Twitter-roBERTa-base for Hate Speech Detection model, available on Hugging Face, for sexism detection. This model leverages the power of pre-trained transformer architectures to achieve state-of-the-art performance.

Assignment 2: LLM-Based Models

This part of the project focuses on Large Language Models (LLMs) for sexism detection using Zero-shot and Few-shot prompting.

The following LLMs were used:

Mistral-7B-Instruct-v0.3
Phi-3.5-mini-instruct

Contributors

Habib Kazemi
Hesam Sheikh Hassani
Ehsan Ramezani

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name	Name	Last commit message	Last commit date
Latest commit kazemihabib Remove .Ds_Store Jan 15, 2025 f1b3448 · Jan 15, 2025 History 41 Commits
data	data	Add Assignment2 notebook and report	Jan 15, 2025
stored_data	stored_data	Add stored data frames	Dec 3, 2024
.gitignore	.gitignore	finished task 3	Nov 14, 2024
Assignment1.ipynb	Assignment1.ipynb	Rename Assignment1 notebook file and add report	Jan 15, 2025
Assignment1_report.pdf	Assignment1_report.pdf	Rename Assignment1 notebook file and add report	Jan 15, 2025
Assignment2_report.pdf	Assignment2_report.pdf	Add Assignment2 notebook and report	Jan 15, 2025
Assignment_2.ipynb	Assignment_2.ipynb	Add Assignment2 notebook and report	Jan 15, 2025
README.md	README.md	Add Readme	Jan 15, 2025
final_predictions_1.pkl	final_predictions_1.pkl	Update error analysis for LSTM	Jan 7, 2025
lstm_results_1.pkl	lstm_results_1.pkl	Update error analysis for LSTM	Jan 7, 2025
preprocessing_strategies.png	preprocessing_strategies.png	finished preprocessing and embedding	Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sexism Detection

Table of Contents

Introduction

Approaches

Assignment 1: LSTM and Transformer-Based Models

LSTM-Based Models

Transformer-Based Models

Assignment 2: LLM-Based Models

Contributors

License

About

Releases

Packages

Contributors 3

Languages

kazemihabib/Sexism_Detection

Folders and files

Latest commit

History

Repository files navigation

Sexism Detection

Table of Contents

Introduction

Approaches

Assignment 1: LSTM and Transformer-Based Models

LSTM-Based Models

Transformer-Based Models

Assignment 2: LLM-Based Models

Contributors

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages