2
2
3
3
Full documentation for rocFFT is available at [ rocfft.readthedocs.io] ( https://rocfft.readthedocs.io/en/latest/ ) .
4
4
5
- ## [ (Unreleased) rocFFT 1.0.10 for ROCm 4.1.0]
5
+ ## [ rocFFT 1.0.10 for ROCm 4.1.0]
6
6
7
7
### Added
8
8
- Explicitly specify MAX_THREADS_PER_BLOCK through _ \_ launch\_ bounds\_ for all
9
- manual kernels.
9
+ kernels.
10
+ - Switch to new syntax for specifying AMD GPU architecture names and features.
10
11
11
12
### Optimizations
12
13
- Optimized C2C/R2C 3D 64, 81, 100, 128, 200, 256 cube sizes.
@@ -18,15 +19,6 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
18
19
### Changed
19
20
- rocFFT now automatically allocates a work buffer if the plan
20
21
requires one but none is provided.
21
-
22
- ## [ (Unreleased) rocFFT 1.0.9 for ROCm 4.0.0]
23
-
24
- ### Added
25
- - Explicitly specify MAX_THREADS_PER_BLOCK through _ \_ launch\_ bounds\_ for all
26
- generated kernels.
27
- - Switch to new syntax for specifying AMD GPU architecture names and features.
28
-
29
- ### Changed
30
22
- An explicit ` rocfft_status_invalid_work_buffer ` error is now
31
23
returned when a work buffer of insufficient size is provided.
32
24
- Updated online documentation.
@@ -90,4 +82,4 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
90
82
- Fixed correctness of certain large 1D, and 2D power-of-3, 5 sizes.
91
83
- Fixed incorrect buffer assignment for some even-length R2C transforms.
92
84
- Fixed ` <cstddef> ` inclusion on C compilers.
93
- - Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
85
+ - Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
0 commit comments