diff --git a/cuda_core/docs/source/release/0.2.0-notes.rst b/cuda_core/docs/source/release/0.2.0-notes.rst index 02f586d58..87353f476 100644 --- a/cuda_core/docs/source/release/0.2.0-notes.rst +++ b/cuda_core/docs/source/release/0.2.0-notes.rst @@ -27,8 +27,30 @@ New features - Expose :class:`ObjectCode` as a public API, which allows loading cubins from memory or disk. For loading other kinds of code types, please continue using :class:`Program`. - A C++ helper function ``get_cuda_native_handle()`` is provided in the new ``include/utility.cuh`` header to retrive the underlying CUDA C objects (ex: ``CUstream``) from a Python object returned by the ``.handle`` attribute (ex: :attr:`Stream.handle`). - For objects such as :class:`Program` and :class:`Linker` that could dispatch to different backends, a new ``.backend`` attribute is provided to query this information. -- Support CUDA event timing. -- An :class:`~_event.Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method. +- Support CUDA :class:`Event` timing. (#481, #498, #508) +- An :class:`Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method. +- :class:`Program` now supports the additional ``PTX`` code type. (#317) +- :meth:`Linker.link` exceptions now include the original error log. (#423) +- In a systematic sweep through the cuda.core implementations, many exceptions messages were made more consistent and informative. (#458) + +New examples +------------ +- ``jit_lto_fractal.py`` — Demonstrates just-in-time link-time optimization for fractal generation. (:class:`Device`, :class:`LaunchConfig`, :class:`Linker`, :class:`LinkerOptions`, :class:`Program`, :class:`ProgramOptions`) (#475) +- ``simple_multi_gpu_example.py`` — Example of using multiple GPUs. (:class:`Device`, :class:`Program`, :class:`LaunchConfig`) (#304) +- ``show_device_properties.py`` — Displays detailed device properties. (:class:`Device`) (#474) + +Minor fixes and enhancements +---------------------------- +- A dangling pointer problem in ``_linker.py`` was fixed. (#516) +- Add ``@functools.lru_cache`` decorator for :func:`get_binding_version`. (#512) +- Selected ``.decode()`` were changed to ``.decode("utf-8", errors="backslashreplace")`` to ensure that decoding error messages does not abort the process. (#510) +- The performance of :meth:`Device.compute_capability` was improved. (#459) +- The :class:`Program` constructor now issues a warning when falling back to :func:`cuLink`. (#315) +- To avoid deprecation warnings, the cuda.bindings imports in the cuda.core implementations were cleaned up. (#404) + +Test fixes +---------- +- Clean up device initialization in some tests. (#507) Limitations -----------