Language Modeling with Phrase Induction

This repository contains the code used for the following paper:

Improving Neural Language Models by Segmenting, Attending, and Predicting the Future

This code is based on

Please let me (Hongyin) know if you have any question about the paper or code via email. If you use this code or our results in your research, please cite as appropriate:

@InProceedings{Luo2019ACL,
  author    = {Luo, Hongyin and Jiang, Lan and Belinkov, Yonatan and Glass, James},
  title     = {Improving Neural Language Models by Segmenting, Attending, and Predicting the Future},
  booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL)},
  month     = {July},
  year      = {2019},
  address   = {Florence},
  publisher = {Association for Computational Linguistics},
}

Software Requirements

Python 3 and PyTorch 0.4 are required for the LSTM language models for PTB and Wikitext-2.
Python 2 and Tensorflow 1.12.0 are required for the Transformer-XL language model on Wikitext-3.

Experiments

For data setup, run ./getdata.sh. This script collects the Mikolov pre-processed Penn Treebank and the WikiText-2 datasets and places them in the data directory.

Word level Penn Treebank (PTB) with LSTM

You can train an LSTM language model on PTB using the following command. The checkpoint will be stored in ./models/

./train_span.sh MODEL_FILE_NAME

You will get a language model achieving perplexities of approximately 59.6 / 57.5 running this.

The finetuning process can be done with the following command,

./finetune_ptb.sh MODEL_FILE_NAME

The finetuning process can produce a language model achieves 57.8 / 55.7 perplexities.

Word level WikiText-2 (WT2) with LSTM

You can train an LSTM language model on WT2 using the following command. The checkpoint will be stored in ./models/

./train_span_wt2.sh MODEL_FILE_NAME

You will get a language model achieving perplexities of approximately 68.4 / 65.2 running this.

The finetuning process can be done with the following command,

./finetune_wt2.sh MODEL_FILE_NAME

The finetuning process can produce a language model achieves 66.9 / 64.1 perplexities.

Word level WikiText-103 (WT103) with Transformer-XL

CODE COMING SOON

Download the pretrained Transformer-XL + Phrase Induction model here to reproduce the 17.4 perplexity on the test set of WT103.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
data/enwik8		data/enwik8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
embed_regularize.py		embed_regularize.py
eval.py		eval.py
eval_phrase.sh		eval_phrase.sh
finetune.py		finetune.py
finetune_bi.py		finetune_bi.py
finetune_ptb.sh		finetune_ptb.sh
finetune_span.py		finetune_span.py
finetune_wt2.sh		finetune_wt2.sh
generate.py		generate.py
getdata.sh		getdata.sh
heights.np		heights.np
locked_dropout.py		locked_dropout.py
main.py		main.py
main_bspan.py		main_bspan.py
main_span.py		main_span.py
main_span_lstm2tcn.py		main_span_lstm2tcn.py
model.py		model.py
pair_matrix.np		pair_matrix.np
pointer.py		pointer.py
splitcross.py		splitcross.py
tcn.py		tcn.py
tcn_bi.py		tcn_bi.py
test_input.txt		test_input.txt
test_model_span2.sh		test_model_span2.sh
train_bilm.sh		train_bilm.sh
train_bspan.sh		train_bspan.sh
train_lstm2tcn.sh		train_lstm2tcn.sh
train_span.sh		train_span.sh
train_span_wt103.sh		train_span_wt103.sh
train_span_wt2.sh		train_span_wt2.sh
utils.py		utils.py
weight_drop.py		weight_drop.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Modeling with Phrase Induction

Software Requirements

Experiments

Word level Penn Treebank (PTB) with LSTM

Word level WikiText-2 (WT2) with LSTM

Word level WikiText-103 (WT103) with Transformer-XL

About

Releases

Packages

Contributors 5

Languages

License

luohongyin/PILM

Folders and files

Latest commit

History

Repository files navigation

Language Modeling with Phrase Induction

Software Requirements

Experiments

Word level Penn Treebank (PTB) with LSTM

Word level WikiText-2 (WT2) with LSTM

Word level WikiText-103 (WT103) with Transformer-XL

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages