Highlights

Models

Language Model
- The Large Scale Word Language Model as introduced by Jozefowicz, Rafal, et al. “Exploring the limits of language modeling”. arXiv preprint arXiv:1602.02410 (2016) achieved test PPL 43.62 on GBW dataset (#179 #270 #277 #278 #286 #294)
- The NT-ASGD based Language Model as introduced by Merity, S., et al. “Regularizing and optimizing LSTM language models”. ICLR 2018 achieved test PPL 65.62 on WikiText-2 dataset (#170)
Document Classification
- The Classification Model as introduced by Joulin, Armand, et al. “Bag of tricks for efficient text classification” achieved validation accuracy validation accuracy 98 on Yelp review dataset (#258 #297)
Question Answering
- The QANet as introduced by Jozefowicz, Rafal, et al. “
  QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension”. ICLR 2018 achieved F1 score 79.5 on SQuAD 1.1 dataset (#339) (coming soon to master branch)

Machine Translation
- The Google NMT as introduced by Wu, Yonghui, et al. “Google's neural machine translation system:
  Bridging the gap between human and machine translation”. arXiv preprint arXiv:1609.08144 (2016) is introduced as part of the gluonnlp tutorial (#261)
- The Transformer based Machine Translation by Vaswani, Ashish, et al. “Attention is all you need.” Advances in Neural Information Processing Systems. 2017 is introduced as part of the gluonnlp tutorial (#279)
Sentence Embedding
- A Structured Self-attentive Sentence Embedding (#366) by Z. Lin, M. Feng, C. Santos, M. Yu, B. Xiang, B. Zhou, Y. Bengio, "A Structured Self-attentive Sentence Embedding" ICLR 2017 is introduced in gluonnlp tutorial (#366)

Word Embedding
- Wikipedia (#218)
- Fil9 dataset(#363)
- FastText crawl-300d-2M-subword(#336), wiki-news-300d-1M-subword(#368), cc.en.300(#373)

Added dataloader that allows multi-shard sampling (#237 #280 #285)
Simplified DataStream, added DatasetStream, refactored and extended PrefetchingStream (#235)
Unified BPTT batchify for dataset and stream (#246)
Added symbolic beam search (#233)
Added SequenceSampler (#272)
Refactored Transform APIs (#282)
Reorganized index of the repo and model zoo page (#357)