Skip to content

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

License

Notifications You must be signed in to change notification settings

resorcap/cutlass_fpA_intB_gemm

 
 

Repository files navigation

About

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 98.7%
  • Other 1.3%