Skip to content

Fine-tuning mT5 to predict the language a text is written in. We achieve a 99.9% success rate!

Notifications You must be signed in to change notification settings

LouisCaubet/mT5-language-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Language prediction using mT5

Project description

I am fine-tuning Google's mT5 pretrained model using the XNLI dataset to perform language detection on a text.

The model is capable of predicting the language of a text written in one of the 14 languages of the XNLI dataset.

Code

The code is contained in a Juypter Notebook meant to be run on Google Colab using a TPU environment and a GCS bucket for data storage.

Click here to open the notebook in Colab: https://colab.research.google.com/github/LouisCaubet/mT5-language-prediction/blob/main/mT5-language-prediction.ipynb

Sources

About

Fine-tuning mT5 to predict the language a text is written in. We achieve a 99.9% success rate!

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published