PRPN

Parsing Reading Predict Network

This repository contains the code used for word-level language model and unsupervised parsing experiments in Neural Language Modeling by Jointly Learning Syntax and Lexicon paper, originally forked from the PyTorch word level language modeling example. If you use this code or our results in your research, we'd appreciate if you cite our apper as following:

@inproceedings{
shen2018neural,
title={Neural Language Modeling by Jointly Learning Syntax and Lexicon},
author={Yikang Shen and Zhouhan Lin and Chin-wei Huang and Aaron Courville},
booktitle={International Conference on Learning Representations},
year={2018},
url={https://openreview.net/forum?id=rkgOLb-0W},
}

Software Requirements

Python 2.7, NLTK and PyTorch 0.3 are required for the current codebase.

Steps

Install PyTorch 0.3 and NLTK
Download PTB data. Note that the two tasks, i.e., language modeling and unsupervised parsing share the same model strucutre but require different formats of the PTB data. For language modeling we need the standard 10,000 word Penn Treebank corpus data, and for parsing we need Penn Treebank Parsed data.
Scripts and commands
- Language Modeling python main_LM.py --cuda --tied --hard --data /path/to/your/data
The default setting in main_LM.py achieves a test perplexity of approximately 60.97 on PTB test set.
- Unsupervised Parsing python main_UP.py --cuda --tied --hard
The default setting in main_UP.py achieves an unlabeled f1 of approximately 0.70 on the standard test set of PTB WSJ10 subset. For visualizing the parsed sentence trees in nested bracket form, and evaluate the trained model, please run test_phrase_grammar.py

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data/ptb		data/ptb
.gitignore		.gitignore
LICENSE		LICENSE
LSTMCell.py		LSTMCell.py
ParsingNetwork.py		ParsingNetwork.py
PredictNetwork.py		PredictNetwork.py
README.md		README.md
ReadingNetwork.py		ReadingNetwork.py
blocks.py		blocks.py
data.py		data.py
data_ptb.py		data_ptb.py
demo.py		demo.py
hinton.py		hinton.py
main_LM.py		main_LM.py
main_UP.py		main_UP.py
model_PRPN.py		model_PRPN.py
test_phrase_grammar.py		test_phrase_grammar.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRPN

Parsing Reading Predict Network

Software Requirements

Steps

About

Releases

Packages

Contributors 3

Languages

License

yikangshen/PRPN

Folders and files

Latest commit

History

Repository files navigation

PRPN

Parsing Reading Predict Network

Software Requirements

Steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages