You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the Bug
Unit tests throw an instance of std::bad_any_cast when compiling with MATX_DISABLE_CUB_CACHE=OFF
Note: Google Test filter = ReductionTestsNumericNoHalfAllExecs/0.Sum*
[==========] Running 1 test from 1 test suite.
[----------] Global test environment set-up.
[----------] 1 test from ReductionTestsNumericNoHalfAllExecs/0, where TypeParam = cuda::std::__4::tuple<unsigned int, matx::cudaExecutor>
[ RUN ] ReductionTestsNumericNoHalfAllExecs/0.Sum
terminate called after throwing an instance of 'std::bad_any_cast'
Note, even compiling with MATX_DISABLE_CUB_CACHE=OFF fails at the moment. The reproducer above has all .cu files in test/CMakeLists.txt commented out except for main.cu and ReductionTests.cu. If all .cu files are included, then we see many compiler errors such as
Consolidate compiler generated dependencies of target matx_test
[ 7%] Building CUDA object test/CMakeFiles/matx_test.dir/00_solver/SVD.cu.o MatX3/include/matx/core/tensor_desc.h: In function 'void matx::svdpi_impl(UType&, SType&, VTType&, AType&, X0Type&, int, const matx::cudaExecutor&, matx::index_t) [with UType = matx::detail::tensor_impl_t<cuda::std::__4::complex<double>, 3, matx::tensor_desc_t<c
uda::std::__4::array<long long int, 3>, cuda::std::__4::array<long long int, 3>, 3> >; SType = matx::detail::tensor_impl_t<double, 2, matx::tenso
r_desc_t<cuda::std::__4::array<long long int, 2>, cuda::std::__4::array<long long int, 2>, 2> >; VTType = matx::detail::tensor_impl_t<cuda::std::
__4::complex<double>, 3, matx::tensor_desc_t<cuda::std::__4::array<long long int, 3>, cuda::std::__4::array<long long int, 3>, 3> >; AType = matx
::detail::tensor_impl_t<cuda::std::__4::complex<double>, 3, matx::tensor_desc_t<cuda::std::__4::array<long long int, 3>, cuda::std::__4::array<lo
ng long int, 3>, 3> >; X0Type = matx::detail::RandomOp<double, cuda::std::__4::array<long long int, 2> >]':
MatX3/include/matx/core/tensor_desc.h:318:33: error: array subscript 3 is outside array bounds of 'matx::detail::tensor_impl_t<double, 1, matx::tensor_desc_t<cuda::std::__4::array<long long int, 1>, cuda::std::__4::array<long long int, 1>, 1> > [1]' [-Werror=array-bounds]
318 | return *(stride_.begin() + dim);
| ^
MatX3/include/matx/transforms/svd/svd_cuda.h:215:6: note: while referencing 's'
215 | auto s = slice<RANK-2>(S,sShapeB, sShapeE);
| ^
Describe the Bug
Unit tests throw an instance of std::bad_any_cast when compiling with MATX_DISABLE_CUB_CACHE=OFF
To Reproduce
Tested with
Expected Behavior
Unit tests should all pass.
Code Snippets
N/A
System Details (please complete the following information):
Additional Context
N/A
The text was updated successfully, but these errors were encountered: