gemv

Here are 4 public repositories matching this topic...

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.

gpu cuda cublas nvidia gemm gemv matrix-multiply tensor-core hgemm cuda-core hgemv

An implementation of SGEMV with performance comparable to cuBLAS.

cuda blas gemv

Matilda is a library to repeatedly multiply a constant matrix with a variable vector

realtime multithreading simd low-latency avx2 adaptive-optics matrix-vector-multiplication avx-512 gemv

Highly optimized DGEMV on CPU with both serial and parallel performance better than MKL and OpenBLAS.

openmp simd blas avx512 mkl gemv

Add a description, image, and links to the gemv topic page so that developers can more easily learn about it.

To associate your repository with the gemv topic, visit your repo's landing page and select "manage topics."