Predict_MissingWords

Masked-Language Modeling With BERT

What is BERT?

BERT (Bidirectional Encoder Representations from Transformers) is a transformer-based method of learning language representations. It is a bidirectional transformer pre-trained model developed using a combination of two tasks namely: masked language modeling objective and next sentence prediction on a large corpus.

Through Pytorch-transformers we can use Bert’s pre-trained language model for predicting missing words. We can also finetune Bert’s pre-trained language model to fit our task and then use that model to gain some improvements.

What is the purpose of this project?

In this project, I recognized verbs in sentences and then other verbs were predicted by Bert, which they could be replaced in that position of sentences. I created 6 levels for prediction and Chose best options for selecting to show in site.

e.g.

Masked-Language Modeling With BERT

Two ways were implemented by Bert and we figure out almost same result for using directly bert model and finetuned language model because our dataset was written in English.

If our data is different than the data used for pretraining, the results would not be that satisfactory. Consider for example if we have a mix of Persian and English language data and we are using a pre-trained model trained on Wikipedia, it would lead to bad results. In that scenario, we need to fine-tune our language model. But, if we have English dataset we do not need finetune our model.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.idea		.idea
MLM		MLM
Prediction		Prediction
Software development		Software development
dataset		dataset
.gitignore		.gitignore
Presentation.pdf		Presentation.pdf
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predict_MissingWords

What is BERT?

What is the purpose of this project?

Masked-Language Modeling With BERT

About

Releases

Packages

Languages

mohadesehjm/Predict_MissingWords

Folders and files

Latest commit

History

Repository files navigation

Predict_MissingWords

What is BERT?

What is the purpose of this project?

Masked-Language Modeling With BERT

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages