Neural Language Models

This repository contains neural language model implementations trained and tested on Penn Treebank.

Multi-layer LSTM with Dropout: The link to the notebook is here. It receives perplexity around 80.6 on test set on default parameters.
Gated Convolutional Networks with Residual Connections: The link to the notebook is here. It receives perplexity around 70.9 on test set on default parameters.

GCNN trains a lot faster than LSTM, due to stacked convolutions performaing parallely. However, this implementation is currently done for fixed word lengths. I am still unclear how to approach for variable lengths.

Requirements

You will need Pytorch 0.4 and Python 3.5 to run this.

How to run

For LSTM code simply run like python3 rnn.py
For GCNN code simply run like python3 gcnn.py

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
notebooks		notebooks
utils		utils
.gitignore		.gitignore
README.md		README.md
gcnn.py		gcnn.py
rnn.py		rnn.py
rnn_generate.py		rnn_generate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Language Models

Requirements

How to run

References

LSTM:

GCNN:

About

Releases

Packages

Languages

ibatra/nlm

Folders and files

Latest commit

History

Repository files navigation

Neural Language Models

Requirements

How to run

References

LSTM:

GCNN:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages