jason-huang03

Jason Huang jason-huang03

I am an undergraduate student from IIIS (Yao Class), Tsinghua University. I am currently interested in efficient algorithm and machine learning system.

58 followers · 7 following

Tsinghua University, NVIDIA
Beijing, China

Achievements

Highlights

Organizations

Pinned Loading

thu-ml/SageAttention thu-ml/SageAttention Public

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 586 27
SPH_Project SPH_Project Public

SPH Realization of Fluid Simulation. Featuring Large Scale Simulation, Rigid-Fluid Coupling and High Viscosity Fluid.

Python 138 10
thu-nics/MoA thu-nics/MoA Public

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 100 6
mit-han-lab/qserve mit-han-lab/qserve Public

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 455 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jason Huang jason-huang03

Achievements

Achievements

Highlights

Organizations

Block or report jason-huang03

Pinned Loading