-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run coredumped #10
Comments
BTW, cudnn is v5.1 |
@AlvinChen13 , The recommend architecture for P100 is https://developer.nvidia.com/cuda-gpus The GEMM benchmark works on a P100 with |
Thanks, fixed the issue. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
My system is installed 4 P100 GPUs and CUDA 8.0. For NCCL, it runs well. And I compile the benchmark by 'make CUDA_PATH=/usr/local/cuda CUDNN_PATH=/usr/local/cuda MPI_PATH=/home/userid/ompi NCCL_PATH=/home/userid/weike/nccl/ ARCH=sm_61'
Anyone can help me the coredump?
userid@ubuntu-WK-4xP100:~/weike/DeepBench/code$ source
/.bashrc/weike/DeepBench/code$ bin/gemm_benchuserid@ubuntu-WK-4xP100:
Times
main: #1.
main: #2.
main: #3.
terminate called after throwing an instance of 'thrust::system::system_error'
what(): function_attributes(): after cudaFuncGetAttributes: invalid device function
Aborted (core dumped)
userid@ubuntu-WK-4xP100:~/weike/DeepBench/code$
The text was updated successfully, but these errors were encountered: