Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run coredumped #10

Closed
AlvinChen13 opened this issue Nov 8, 2016 · 3 comments
Closed

Run coredumped #10

AlvinChen13 opened this issue Nov 8, 2016 · 3 comments

Comments

@AlvinChen13
Copy link

My system is installed 4 P100 GPUs and CUDA 8.0. For NCCL, it runs well. And I compile the benchmark by 'make CUDA_PATH=/usr/local/cuda CUDNN_PATH=/usr/local/cuda MPI_PATH=/home/userid/ompi NCCL_PATH=/home/userid/weike/nccl/ ARCH=sm_61'

Anyone can help me the coredump?

userid@ubuntu-WK-4xP100:~/weike/DeepBench/code$ source /.bashrc
userid@ubuntu-WK-4xP100:
/weike/DeepBench/code$ bin/gemm_bench
Times

m       n      k      a_t     b_t      time (usec) 

main: #1.
main: #2.
main: #3.
terminate called after throwing an instance of 'thrust::system::system_error'
what(): function_attributes(): after cudaFuncGetAttributes: invalid device function
Aborted (core dumped)
userid@ubuntu-WK-4xP100:~/weike/DeepBench/code$

@AlvinChen13
Copy link
Author

BTW, cudnn is v5.1

@sharannarang
Copy link
Contributor

@AlvinChen13 , The recommend architecture for P100 is sm_60. Some of the other Pascal GPUs use sm_61. This page provides more details:

https://developer.nvidia.com/cuda-gpus

The GEMM benchmark works on a P100 with sm_60. Can you try running it again with the correct architecture?

@AlvinChen13
Copy link
Author

Thanks, fixed the issue.

sharannarang pushed a commit that referenced this issue May 2, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants