Skip to content
This repository was archived by the owner on Jan 15, 2024. It is now read-only.

Commit 3fbe961

Browse files
MoisesHerleezu
andauthored
Deploy BERT model - Script (#1237) (#1357)
* Add example script to deploy BERT * Add options to better measure performance * Allow specification of path for exported model * Add option to use custom graph pass * Add optimization for MHA in custom graph pass * Correct bug with input shapes in optimize_for * correct typo * fix lint * fix lint * Add documentation * Add documentation for using deploy script * Correct typo/add spaces in documentation * Add setup.py to compile pass, update documentation * Fix bug in path to include dir & fix pylint * Add unitest for deploy bert script * change CUDA version in wheel * test latest wheel * change path to custom pass library * fixing trigger custom pass compilation * fix lint * fix lint * Update mxnet pip version * Only GPU versions changed * fix lint * change wheel to include mkl headers * lint docstring * remove debug print * change include paths * lint * debugging lib_api.h * debugging lib_api.h * debugging * Disable test for now * skip test if mxnet_version < 1.7.0 * use pytest.mark.skipif to skip test * test only BERT-base (fp16/fp32, SST/QA, embeddings) to avoid timeout Co-authored-by: Leonard Lausen <[email protected]> Co-authored-by: Leonard Lausen <[email protected]>
1 parent 08bc5ba commit 3fbe961

File tree

8 files changed

+1828
-596
lines changed

8 files changed

+1828
-596
lines changed

env/gpu/py3-master.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ dependencies:
3333
- flaky==3.6.1
3434
- flake8==3.7.9
3535
- mock<3
36-
- https://repo.mxnet.io/dist/python/cu100/mxnet_cu100-1.6.0-py2.py3-none-manylinux1_x86_64.whl
36+
- https://repo.mxnet.io/dist/python/cu100/mxnet_cu100-1.7.0b20200809-py2.py3-none-manylinux2014_x86_64.whl
3737
- scipy==1.3.2
3838
- regex==2020.4.4
3939
- nltk==3.4.5

0 commit comments

Comments
 (0)