Skip to content

Commit c3110db

Browse files
authored
CHANGELOG: update for 4.1 release
1 parent a07759d commit c3110db

File tree

1 file changed

+4
-12
lines changed

1 file changed

+4
-12
lines changed

CHANGELOG.md

Lines changed: 4 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -2,11 +2,12 @@
22

33
Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://rocfft.readthedocs.io/en/latest/).
44

5-
## [(Unreleased) rocFFT 1.0.10 for ROCm 4.1.0]
5+
## [rocFFT 1.0.10 for ROCm 4.1.0]
66

77
### Added
88
- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all
9-
manual kernels.
9+
kernels.
10+
- Switch to new syntax for specifying AMD GPU architecture names and features.
1011

1112
### Optimizations
1213
- Optimized C2C/R2C 3D 64, 81, 100, 128, 200, 256 cube sizes.
@@ -18,15 +19,6 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
1819
### Changed
1920
- rocFFT now automatically allocates a work buffer if the plan
2021
requires one but none is provided.
21-
22-
## [(Unreleased) rocFFT 1.0.9 for ROCm 4.0.0]
23-
24-
### Added
25-
- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all
26-
generated kernels.
27-
- Switch to new syntax for specifying AMD GPU architecture names and features.
28-
29-
### Changed
3022
- An explicit `rocfft_status_invalid_work_buffer` error is now
3123
returned when a work buffer of insufficient size is provided.
3224
- Updated online documentation.
@@ -90,4 +82,4 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
9082
- Fixed correctness of certain large 1D, and 2D power-of-3, 5 sizes.
9183
- Fixed incorrect buffer assignment for some even-length R2C transforms.
9284
- Fixed `<cstddef>` inclusion on C compilers.
93-
- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
85+
- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.

0 commit comments

Comments
 (0)