Skip to content

Releases: intel/llvm

DPC++ daily 2022-03-15

15 Mar 15:18
8924fcc
Compare
Choose a tag to compare
Pre-release
[SYCL] Do not treat sycl::half kernel argument specially (#5812)

There was some special handling added when all struct kernel arguments
were decomposed since OpenCL doesn't allow half kernel argument. Once
we've implemented conditional decomposition no special handling needed
for half type.

DPC++ daily 2022-03-14

14 Mar 15:19
661b2c9
Compare
Choose a tag to compare
Pre-release
sycl-nightly/20220314

[NFC][SYCL] Replace hard-coded checks in kernel-arg-accessor-pointer.…

DPC++ daily 2022-03-12

12 Mar 16:47
Compare
Choose a tag to compare
Pre-release
[SYCL][Driver][NFC] Fix unused variable warning (#5792)

llvm/clang/lib/Driver/Driver.cpp:9141:13:
   error: unused variable 'Ext' [-Werror,-Wunused-variable]
      StringRef Ext(llvm::sys::path::extension(FileName).drop_front());

DPC++ daily 2022-03-11

11 Mar 16:54
b2ee289
Compare
Choose a tag to compare
Pre-release
sycl-nightly/20220311

[SYCL][ESIMD] Move rounding functions out of experimental namespace (…

DPC++ daily 2022-03-10

10 Mar 16:53
250c498
Compare
Choose a tag to compare
Pre-release
[SYCL][libclc][CUDA] Add native math extension (#5747)

This patch extends the native math definitions in order to include builtins out of the current SYCL specification.

In particular, this patch adds a ``tanh`` builtin for floats/halfs and a exp2 builtin for ``halfs`` which are mapped to instructions introduced for ``sm_75`` and above.

Tests in https://github.com/intel/llvm-test-suite/pull/895

DPC++ daily 2022-03-09

09 Mar 16:55
2331160
Compare
Choose a tag to compare
Pre-release
[SYCL][ESIMD][EMU] single_task support (#5671)

ESIMD EMU: single_task() support.
ESIMD plugin: misc clarifying refactoring.

DPC++ daily 2022-03-08

08 Mar 16:48
6f81972
Compare
Choose a tag to compare
Pre-release
[CODEOWNERS] Update with new names (#5743)

* Added Tianfei for Release Notes
* Added ESIMD reviewers for SYCLLowerIR/CMakeLists.txt

oneAPI DPC++ Compiler 2021-12

07 Mar 00:46
27f59d8
Compare
Choose a tag to compare

New features

SYCL Compiler

  • Added support for -fgpu-inline-threshold which allows controlling inline
    threshold of the SYCL device code [5f7b607]
  • Added experimental support for CUDA backend on Windows [8aa3513]
  • Added support for experimental option -fsycl-max-parallel-link-jobs=<N>
    which can be used specify how many processes the compiler can use for
    linking the device code [c2221f0]

SYCL Library

Documentation

Improvements

SYCL Compiler

  • Added diagnostics on attempt to pass an incorrect value to
    -fsycl-device-code-split [631fd69]
  • Improved output of -fsycl-help[2404d02]
  • Allowed ::printf builtin for CUDA backend only [0c55d3a]
  • Implemented nextafter for sycl::half on CUDA backend [53c3268]
  • Added atomics with scopes and memory orders for CUDA backend
    [2ebde5f] [00f43b3]
  • Added support for missing mathematical builtins in CUDA backend
    [789ec8b] [f074774] [390e105]
  • Added diagnostic for non-forward declarable kernel name types [653bae9]
  • Added group_ballot intrinsic for CUDA backend [0680e5c]
  • Added support for device side assert for CUDA backend [5a87b8c]
  • Turned on -fsycl-dead-args-optimization by default [5983dfd]
  • Improved compilation time by removing free function queries calls detection
    [e4791d1]
  • Reduced memory consumption of device code linking [6266820]
  • Improved UX of sycl::ext::oneapi::experimental::printf by allowing format
    string to reside in a non-constant address space [2d62e51]
  • Improved barrier and sync instructions to use full mask when targeting NVPTX
    [5ce99b8]
  • Added match for default SPIR device architecture with host architecture i.e.
    x86_64 matches spir64 and i686 matches spir [f4d01cd]
  • Set default device code split mode to off for FPGA [bea72e6]
  • Improved diagnostic for invalid SYCL kernel names
    [455dce8] [df1ff7a]
  • Made Xsycl-target-frontend= to accept device tripple aliases [7fa0569]
  • Improved diagnostic messages for -fsycl-libspirv-path [c54c605]
  • Made implied default device to force emulation for FPGA builds [074944e]
  • Added support for sycl::ext::oneapi::sub_group::get_local_id for HIP
    backend [7a9335d]
  • Added a diagnostic of indirect implicit capture of this for kernel lambda
    [dce4c6a]

SYCL Library

  • Updated joint matrix queries to report if unsigned int variants of mad
    matrix instruction are supported [dd7ebce]
  • Reduced overhead of device code assert implementation [b94f23a] [58ac74e]
  • Added a diagnostic on attempt to call sycl::get_kernel_id with an invalid
    kernel [9dd1ea3]
  • Reduced overhead on kernel submission for CUDA backend [b79ae69]
  • Reduced overhead on kernel submission in backend independent part of runtime
    [e292aa5]
  • Aligned Level-Zero Interoperability API with SYCL 2020 specification
    [dd7f82c] [e662166]
  • Made sycl::half default constructor constexpr [d32a444]
  • Changed CUDA and HIP backends to report each device in a separate platform
    [8dddb11]
  • Added initial support for SYCL2020 exceptions [15e0ab1]
  • Added SYCL 2020 sycl::target::device enumeration value [f710886]
  • Added a diagnostic on attempt to print std::byte using sycl::stream
    [dd5e094]
  • Added possibility to specify ownership of ze_module_handle_t when creating
    a sycl::kernel_bundle from it [e3c9c92]
  • Improve performance of sycl::nd_item::get_group_range() [0cd7b7e]
  • Deprecated sycl::target::global_buffer
  • Made device_num which can be passed to SYCL_DEVICE_FILTER unique
    [7aa5be0]
  • Added a diagnostic on using mutually exclusive sycl::handler methods
    [6f620a4]
  • Added support for std::byte to sycl::vec class [8fa04fe]
  • Added sycl::make_kernel interoperability support for Level-Zero backend
    [98896fd]
  • Optimized work with events in the Level Zero backend [973aee9]
  • Added support for sycl::ext::oneapi::experimental::matrix::wi_slice and
    sycl::ext::oneapi::experimental::matrix::joint_matrix_fill
    [97127eb] [cbad428]
  • Enabled code location information when NDEBUG is not defined in XPTI
    notifications [e9f2d64] [9ca7cea]
  • Added a diagnostic on attempt to pass a command group function object to
    sycl::queue::single_task [2614d4d]
  • Enlarged the maximum batch size to 64 for Level Zero backend to improve
    performance [596f693]
  • Reduced kernel submission overhead for CUDA backend [35729a7]
  • Improved translation of Level Zero error codes [6699a5d], [5d9a04b]
  • Added support for an arbitrary number of elements to
    sycl::ext::intel::experimental::esimd::simd::copy_from/to methods
    [2bdc4c4]
  • Added HIP support to sycl::ext::oneapi::filter_selector
    [7224cb2], [b7cee06]
  • Added support for batching copy commands for Level Zero backend [4c3e699]
  • Reduced sycl::queue::submit overhead by enabling post-enqueue execution
    graph cleanup [6fd6098]
  • Added support for classes implicitly converted from sycl::item in
    sycl::handler::parallel_for parameter to align with the SYCL 2020
    specification [34b93bf]
  • Removed direct initialization constructor from
    sycl::ext::intel::experimental::bfloat16 class [81154ec]
  • Added sycl::vec and sycl::marray support to sycl::known_identity type
    trait [8fefb25]
  • Added minimal support for the generic space address space to match
    sycl::atomic_ref class definition in specification [e99f298]
  • Improved cache of command-lists in the context to be per-device for Level
    Zero backend [ca457d9]
  • Extended group algorithms to support broadened types [3205368]
  • Added support for alignement flags in
    sycl::ext::intel::experimental::esimd::simd::copy_from/copy_to operations
    [27f5c12]
  • Made sycl::ext::oneapi::atomic_ref available in sycl namespace
    [2cdcbed]
  • Renamed cuda and hip enum values and namespaces to ext_oneapu_cuda and
    ext_oneapi_hip to align with SYCL 2020 specification [97f916e]
  • Improved performance of kernel submission process [535ad1e]
  • Eliminated build of unwanted kernels when creating one with make_kernel
    [53ea8b9]
  • Removed duplicate devices on submission to kernel_bundle API functions
    [c222497]
  • Deprecated sycl::aspects::int64_base_atomics and
    sycl::aspects::int64_extended_atomics [554b79c]
  • Made backend specific headers be included implicitly [bc8a00a]
  • Removed program class and related API [e7cc7b0]
  • Excluded current working directory from DLL search path when looking for
    runtime dependencies [0a65cb4]
  • Enabled persistent device code cache for kernel bundles [810d67a]
  • R...
Read more

DPC++ daily 2022-03-07

07 Mar 16:48
a914680
Compare
Choose a tag to compare
Pre-release
[BuildBot] Uplift GPU RT version for Linux to 22.09.22577 (#5742)

Signed-off-by: bb-sycl <[email protected]>

DPC++ daily 2022-03-05

05 Mar 16:53
2741010
Compare
Choose a tag to compare
Pre-release
sycl-nightly/20220305

[ESIMD] Disable non-critical messages from VC backend (disable-finali…