Running the workloads

Steps to set up dependancies

Rerun torch just to make sure your version's the latest one

git clone [email protected]:Aayush-Ankit/isca_workloads.git
luarocks install torch
Note: luarocks intall may have an issue with systems seating behind a proxy, so users have to change the git config file to use https:// than git://, so please follow this instructions.
[url "https://"]
insteadOf = git://
Add above statements to your repository gitconfig file "e.g: vim ~/.gitconfig " For more information refer to this link --> https://github.com/luarocks/luarocks/wiki/LuaRocks-through-a-proxy
luarocks install nn
luarocks install dpnn
luarocks install torchx

To use with CUDA

luarocks install cutorch
luarocks install cunn
luarocks install cunnx

Install RNN dependancy (allows using sequencers)

cd rnn
luarocks make rocks/rnn-scm-1.rockspec
Note: the above command may break due to no proper rnn directory CMakeList cleanup. If that occurs please delete rnn and reclone the directory and run it again.
2.a rm rnn
2.b git clone [email protected]:Element-Research/rnn.git
2.c Repeat steps 1 and 2 again

Yay! Setup's Done!!!

Running a benchmark on CPU/GPU

Some info. about the benchmarks

wlm_bigLSTM - bigLSTM network for word-level language modelling (Google 1B dataset)
wlm_anotherLSTM - another deep LSTM network for word-level language modelling (Google 1B dataset)
nmt_l5 - Google Machine Tranalation for English-French (WMT15 dataset)
nmt_l3 - Google Machine Tranalation for English-French (WMT15 dataset)

th <.lua> -gpu <0/1> -threads <non-zero> -batch <non-zero>

cmdline options:

gpu > use 0 for CPU run, 1 for GPU run (default is CPU)
threads > useful for CPU runs, can increase to evaluate CPU performance (default is 1)
batch > can be varied to find the CPU, GPU numbers (inference time, power) variation. FOr GPU, can increase batchsize until torch throws THCudaCheck: out of memory error

Metrics of Interest which are printed

Number of parameters in the network
Inference time on (CPU/GPU) NOTE: For, CPU inference time, run it twice (and use the 2nd value) to make sure the data movement cost from HDD isn't included
The <>pow.txt file shows gpu power consumption

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
cpu_profile_data		cpu_profile_data
gpu_profile_data		gpu_profile_data
models-train		models-train
rnn		rnn
.gitignore		.gitignore
README.md		README.md
alexnet.lua		alexnet.lua
alexnet_iter_test.lua		alexnet_iter_test.lua
deepspeech2.lua		deepspeech2.lua
haswell_pwr_data.txt		haswell_pwr_data.txt
mlp_l4.lua		mlp_l4.lua
mlp_l4_iter_test.lua		mlp_l4_iter_test.lua
mlp_l5.lua		mlp_l5.lua
mlp_l5_iter_test.lua		mlp_l5_iter_test.lua
nmt_l3.lua		nmt_l3.lua
nmt_l3_iter_test.lua		nmt_l3_iter_test.lua
nmt_l5.lua		nmt_l5.lua
nmt_l5_iter_test.lua		nmt_l5_iter_test.lua
rnnTest_gpuPow.txt		rnnTest_gpuPow.txt
run_cpu_alexnet.sh		run_cpu_alexnet.sh
run_cpu_mlp_l4.sh		run_cpu_mlp_l4.sh
run_cpu_mlp_l5.sh		run_cpu_mlp_l5.sh
run_cpu_nmt_l3.sh		run_cpu_nmt_l3.sh
run_cpu_nmt_l5.sh		run_cpu_nmt_l5.sh
run_cpu_vgg16.sh		run_cpu_vgg16.sh
run_cpu_vgg19.sh		run_cpu_vgg19.sh
run_cpu_wlm_anotherLSTM.sh		run_cpu_wlm_anotherLSTM.sh
run_cpu_wlm_bigLSTM.sh		run_cpu_wlm_bigLSTM.sh
run_gpu_alexnet.sh		run_gpu_alexnet.sh
run_gpu_mlp_l4.sh		run_gpu_mlp_l4.sh
run_gpu_mlp_l5.sh		run_gpu_mlp_l5.sh
run_gpu_nmt_l3.sh		run_gpu_nmt_l3.sh
run_gpu_nmt_l5.sh		run_gpu_nmt_l5.sh
run_gpu_vgg16.sh		run_gpu_vgg16.sh
run_gpu_vgg19.sh		run_gpu_vgg19.sh
run_gpu_wlm_anotherLSTM.sh		run_gpu_wlm_anotherLSTM.sh
run_gpu_wlm_bigLSTM.sh		run_gpu_wlm_bigLSTM.sh
skylake_pwr_reading.txt		skylake_pwr_reading.txt
vgg16.lua		vgg16.lua
vgg16_iter_test.lua		vgg16_iter_test.lua
vgg19.lua		vgg19.lua
vgg19_iter_test.lua		vgg19_iter_test.lua
wlm_anotherLSTM.lua		wlm_anotherLSTM.lua
wlm_anotherLSTM_iter_test.lua		wlm_anotherLSTM_iter_test.lua
wlm_bigLSTM.lua		wlm_bigLSTM.lua
wlm_bigLSTM_iter_test.lua		wlm_bigLSTM_iter_test.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Running the workloads

Steps to set up dependancies

Rerun torch just to make sure your version's the latest one

To use with CUDA

Install RNN dependancy (allows using sequencers)

Yay! Setup's Done!!!

Running a benchmark on CPU/GPU

Some info. about the benchmarks

cmdline options:

Metrics of Interest which are printed

About

Releases

Packages

Contributors 2

Languages

Aayush-Ankit/ml-inference-benchmarks

Folders and files

Latest commit

History

Repository files navigation

Running the workloads

Steps to set up dependancies

Rerun torch just to make sure your version's the latest one

To use with CUDA

Install RNN dependancy (allows using sequencers)

Yay! Setup's Done!!!

Running a benchmark on CPU/GPU

Some info. about the benchmarks

cmdline options:

Metrics of Interest which are printed

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages