Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 408 Bytes

TTI-cpu-inference-stack.md

File metadata and controls

12 lines (9 loc) · 408 Bytes

TTI cpu inference stack





  • port models to ONNX and compile
  • quantize weights, low precision
  • zero off-loading