Gender Bias in Machine Translation

Overview

This project investigates gender bias in machine translation systems, focusing on their ability to produce gendered forms (masculine and feminine) when translating sentences from English into target languages with grammatical gender. The research evaluates the performance of several state-of-the-art models, including GPT (OpenAI), Claude (Anthropic), and Gemini (Google DeepMind).

Objective

The primary objective is to determine whether machine translation models accurately reflect gendered forms when translating a demonym (nationality) into languages with grammatical gender. The study seeks to highlight patterns of gender bias, assess linguistic accuracy, and explore potential improvements.

Dataset

Source Sentence Format: "I am "
Source Language: English
Target Languages: Spanish (spa), Italian (it), German (deu), and French (fra).
Demonyms: The dataset includes demonyms from 193 UN-recognized countries.

Methodology

Data Preparation:
- Compile a list of demonyms from all 193 UN-recognized countries.
- Create test sentences in the format: "I am ".
Translation:
- Translate the sentences from English to the target languages using each model (GPT, Claude, Gemini).
- Ensure consistent input prompts and translation settings for comparability.

Status

GPT: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
Claude: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
Gemini: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
Other Languages: Updates will follow after additional testing.

Results and Metrics

(pending)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
nlp2025		nlp2025
.DS_Store		.DS_Store
11-29.ipynb		11-29.ipynb
README.md		README.md
gramm_gender.ipynb		gramm_gender.ipynb
visualization.ipynb		visualization.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gender Bias in Machine Translation

Overview

Objective

Dataset

Methodology

Status

Results and Metrics

About

Releases

Packages

Languages

NyokoKei/Gender_Bias

Folders and files

Latest commit

History

Repository files navigation

Gender Bias in Machine Translation

Overview

Objective

Dataset

Methodology

Status

Results and Metrics

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages