English-to-Hindi-Translation

The project is about translating English sentences to Hindi sentences using Transformers.
I have used Tensorflow for the project and this article has helped to understand its implementation.

Dataset

Dataset used can be found here.
It contains around 100K pairs of English and Hindi sentences.

Processing Text

First I have done basic text processing which includes things like lowering of sentences, removing any URLs, removing digits etc.
[Start] and [End] tags are then added to Hindi Sentences.
TextVectorization from keras is used to create sentence vectors.
The vocabulary size is 20000 and sentence length is 20.

Model

Here 80K samples are taken for training each with a length <= 20 words.
Here in Transformer model I have used only 1 encoder and 1 decoder.
The Embedding dim is 128, no. of heads in MultiHeadAttention is 10, latent dim is 2048 which is used in Feed Forward Network with dropout of 0.2.

Training and Evaluating Model

The Epochs are set to 50 with Optimizer as Adam, Loss as sparse_categorical_crossentropy and Metric as accuracy.
Two callback functions Reduce LR on Plateau and Early Stopping are also used.

After evaluating on 500 samples the BLEU score was 24.5.
The BLEU score is not that great but still I learned a lot about Transformer.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
English_To_Hindi_Translation.ipynb		English_To_Hindi_Translation.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English-to-Hindi-Translation

Dataset

Processing Text

Model

Training and Evaluating Model

About

Releases

Packages

Languages

Dev-Khant/English-to-Hindi-Translation

Folders and files

Latest commit

History

Repository files navigation

English-to-Hindi-Translation

Dataset

Processing Text

Model

Training and Evaluating Model

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages