Skip to content
This repository was archived by the owner on Jan 15, 2024. It is now read-only.

Commit a946474

Browse files
eric-haibin-linszha
authored andcommitted
update hash (#332)
1 parent b76bd0f commit a946474

File tree

3 files changed

+7
-7
lines changed

3 files changed

+7
-7
lines changed

gluonnlp/model/language_model.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -422,7 +422,7 @@ def standard_lstm_lm_1500(dataset_name=None, vocab=None, pretrained=False, ctx=c
422422
('700b532dc96a29e39f45cb7dd632ce44e377a752', 'standard_lstm_lm_200_wikitext-2'),
423423
('45d6df33f35715fb760ec8d18ed567016a897df7', 'awd_lstm_lm_1150_wikitext-2'),
424424
('7894a046f8286db0d5d2ed672b60f4f52b4bc3aa', 'awd_lstm_lm_600_wikitext-2'),
425-
('92ed6f71c4b12f465b11d72e7271743d0c4c1a7a', 'big_rnn_lm_2048_512_gbw'),
425+
('6bb3e991eb4439fabfe26c129da2fe15a324e918', 'big_rnn_lm_2048_512_gbw'),
426426
]})
427427

428428
class BigRNN(Block):

scripts/language_model/language_model.rst

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -179,18 +179,18 @@ The dataset used for training the models is Google's 1 billion words dataset.
179179
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
180180
| Graident clip | 10.0 |
181181
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
182-
| Test perplexity | 43.72 |
182+
| Test perplexity | 43.62 |
183183
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
184-
| Num epochs | 42 |
184+
| Num epochs | 50 |
185185
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
186186
| Training logs | `log <https://github.com/dmlc/web-data/blob/master/gluonnlp/logs/language_model/big_rnn_lm_2048_512_gbw.log>`__ |
187187
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
188188
| Evaluation logs | `log <https://github.com/dmlc/web-data/blob/master/gluonnlp/logs/language_model/big_rnn_lm_2048_512_gbw-eval.log>`__ |
189189
+-----------------+------------------------------------------------------------------------------------------------------------------------------+
190190

191-
[1] LSTM-2048-512 (Test PPL 43.72, based on `commit <https://github.com/eric-haibin-lin/gluon-nlp/tree/b3cfcc427fbceea2ce3c5fae63c61b93910326dc>`__)
191+
[1] LSTM-2048-512 (Test PPL 43.62)
192192

193193
.. code-block:: console
194194
195-
$ python large_word_language_model.py --gpus 0,1,2,3 --epochs=42 --clip=10
196-
$ python large_word_language_model.py --gpus 4 --eval-only --batch-size=1 --log-interval=999999
195+
$ python large_word_language_model.py --gpus 0,1,2,3 --clip=10
196+
$ python large_word_language_model.py --gpus 4 --eval-only --batch-size=1

scripts/language_model/large_word_language_model.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@
8383
help='list of gpus to run, e.g. 0 or 0,2,5. empty means using cpu.')
8484
parser.add_argument('--log-interval', type=int, default=1000,
8585
help='report interval')
86-
parser.add_argument('--seed', type=int, default=1,
86+
parser.add_argument('--seed', type=int, default=0,
8787
help='random seed')
8888
parser.add_argument('--lr', type=float, default=0.2,
8989
help='initial learning rate')

0 commit comments

Comments
 (0)