This project investigates gender bias in machine translation systems, focusing on their ability to produce gendered forms (masculine and feminine) when translating sentences from English into target languages with grammatical gender. The research evaluates the performance of several state-of-the-art models, including GPT (OpenAI), Claude (Anthropic), and Gemini (Google DeepMind).
The primary objective is to determine whether machine translation models accurately reflect gendered forms when translating a demonym (nationality) into languages with grammatical gender. The study seeks to highlight patterns of gender bias, assess linguistic accuracy, and explore potential improvements.
- Source Sentence Format: "I am "
- Source Language: English
- Target Languages: Spanish (spa), Italian (it), German (deu), and French (fra).
- Demonyms: The dataset includes demonyms from 193 UN-recognized countries.
-
Data Preparation:
- Compile a list of demonyms from all 193 UN-recognized countries.
- Create test sentences in the format: "I am ".
-
Translation:
- Translate the sentences from English to the target languages using each model (GPT, Claude, Gemini).
- Ensure consistent input prompts and translation settings for comparability.
- GPT: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
- Claude: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
- Gemini: Completed.
- Experiment completed, but metrics and detailed analysis are still pending.
- Other Languages: Updates will follow after additional testing.
- (pending)