Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test failure on unstable mpi_gf_np2 / 3 /4 failing #977

Open
the-hampel opened this issue Jan 9, 2025 · 1 comment
Open

test failure on unstable mpi_gf_np2 / 3 /4 failing #977

the-hampel opened this issue Jan 9, 2025 · 1 comment

Comments

@the-hampel
Copy link
Member

currently the solid_dmft CI (unstable branch) is failing because on triqs/unstable 3 tests seem to fail https://github.com/TRIQS/solid_dmft/actions/runs/12678000403/job/35334605738 . I can reproduce the issue locally on my Mac with a completely different compiler setup. The test output is:

    Start 166: mpi_gf_np2

166: Test command: /opt/homebrew/bin/mpiexec "-n" "2" "--oversubscribe" "/Users/ahampel/git/triqs/triqs/build/test/c++/gfs/mpi_gf"
166: Working Directory: /Users/ahampel/git/triqs/triqs/build/test/c++/gfs
166: Test timeout computed to be: 10000000
166: MPI environment detected
166: [==========] Running 5 tests from 1 test suite.
166: [----------] Global test environment set-up.
166: [----------] 5 tests from MpiGf
166: [ RUN      ] MpiGf.Reduce
166: MPI environment detected
166: [==========] Running 5 tests from 1 test suite.
166: [----------] Global test environment set-up.
166: [----------] 5 tests from MpiGf
166: [ RUN      ] MpiGf.Reduce
166: [       OK ] MpiGf.Reduce (0 ms)
166: [ RUN      ] MpiGf.AllReduce
166: [       OK ] MpiGf.AllReduce (0 ms)
166: [ RUN      ] MpiGf.ReduceView
166: [       OK ] MpiGf.ReduceView (0 ms)
166: [ RUN      ] MpiGf.ReduceBlock
166: [       OK ] MpiGf.Reduce (0 ms)
166: [ RUN      ] MpiGf.AllReduce
166: [       OK ] MpiGf.AllReduce (0 ms)
166: [ RUN      ] MpiGf.ReduceView
166: [       OK ] MpiGf.ReduceView (0 ms)
166: [ RUN      ] MpiGf.ReduceBlock
166: [       OK ] MpiGf.ReduceBlock (0 ms)
166: [ RUN      ] MpiGf.ReduceBlockView
166: [       OK ] MpiGf.ReduceBlock (0 ms)
166: [ RUN      ] MpiGf.ReduceBlockView
166: libc++abi: terminating due to uncaught exception of type nda::runtime_error: Error at /Users/ahampel/git/triqs/triqs/build/deps/nda_src/c++/nda/././linalg/../././_impl_basic_array_view_common.hpp : 455
166:
166: Error in assign_from_ndarray: Size mismatch:
166:  LHS.shape() = (16 1 1)
166:  RHS.shape() = (0 0 0)

Locally I am using:

-- The CXX compiler identification is Clang 19.1.4
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /opt/homebrew/opt/llvm/bin/clang++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- -------- Checking compiler C++ status-------------
-- Build type: Release
-- -------- triqs version and git hash detection -------------
-- Triqs version : 3.3.1
-- Git hash : 8bc7491ff20fe2b1188f9c582048bdc7b3e42d7c
...
--  =============== Configuring Dependency mpi ===============
-- mpi version : 1.3.0
-- mpi Git hash: a22f9a8c9c8755498743c1edb465ae4b0e0ef38e
-- Dependency GTest was already resolved.
-- Dependency itertools was already resolved.
-- -------- MPI detection -------------
-- Found MPI_CXX: /opt/homebrew/Cellar/open-mpi/5.0.6/lib/libmpi.dylib (found version "3.1")
-- Found MPI: TRUE (found version "3.1") found components: CXX
-- Found OpenMP_CXX: -fopenmp=libomp (found version "5.1")

and the solid_dmft CI is using: gcc 13 and openmpi 4 from Ubuntu 24.04 .

Could someone have look into this? Thanks!

@Thoemi09
Copy link
Contributor

Hi Alex,

Sorry about that. There is already a PR that should fix the problem (#973) but we haven't merged it yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants