Modify the Makefile to set proper paths for CUDA toolkit and libraries.
Type 'make' to compile and './transpose' to execute.
Cite the work as follows:
Ayaz ul Hassan Khan, M. A. Al-Mouhamed, A. Almousa, A. Fatayar, A. Baqais, and M. Assayony, “Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm”, International Journal of Networked and Distributed Computing, Vol. 2, issue 3, pp 124-134, July 2014, DOI: doi:10.2991/ijndc.2014.2.3.2.
A. H. Khan, M. A. Al-Mouhamed, A. Almousa, A. Fatayar, A. Baqais, and M. Assayony, “Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm”, 15th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD 2014).