ONNX Runtime v1.18.1
What's new?
Announcements:
- ONNX Runtime Python packages now have numpy dependency >=1.21.6, <2.0. Support for numpy 2.0 will be added in a future release.
- CUDA 12.x ONNX Runtime GPU packages are now built against cuDNN 9.x (1.18.0 packages previously depended on cuDNN 8.x). CUDA 11.x ONNX Runtime GPU packages continue to depend on CuDNN 8.x.
- Windows packages require installation of Microsoft Visual C++ Redistributable Runtime 14.38 or newer.
TensorRT EP:
- TensorRT Weightless API integration.
- Support for TensorRT hardware compatible engines.
- Support for INT64 types in TensorRT constant layer calibration.
- Now using latest commit of onnx-tensorrt parser, which includes several issue fixes.
- Additional TensorRT support and performance improvements.
Packages:
- Publish CUDA 12 Java packages to Azure DevOps feed.
- Various packaging pipeline fixes.
This patch release also features various other bug fixes, including a CUDA 12.5 build error fix.
Big thank you to @yf711 for driving this release as the release manager and to all our contributors!
@yf711 @jchen351 @mszhanyi @snnn @wangyems @jywu-msft @skottmckay @chilo-ms @moraxu @kevinch-nv @pengwa @wejoncy @pranavsharma @Craigacp @jslhcl @adrianlizarraga @inisis @jeffbloo @mo-ja @kunal-vaishnavi @sumitsays @neNasko1 @yufenglee @dhruvbird @wangshuai09 @xiaoyu-work @axinging @yuslepukhin @YUNQIUGUO @shubhambhokare1 @fs-eire @afantino951 @tboby @HectorSVC @baijumeswani