Sparsity-aware deep learning inference runtime for CPUs
nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse
-
Updated
May 5, 2025 - Python