Skip to content

Expose serve-time skip-softmax sparsity via MODELOPT_MEASURE_SPARSITY

b4508e3
Select commit
Loading
Failed to load commit list.
Draft

Skip-Softmax calibration in vLLM #1622

Expose serve-time skip-softmax sparsity via MODELOPT_MEASURE_SPARSITY
b4508e3
Select commit
Loading
Failed to load commit list.
DCO / DCO succeeded Jun 6, 2026 in 1s

DCO

All commits are signed off!