Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Builds are reported as "Failed" even though bazel's exitcode is 0 #21105

Open
BetsyMcPhail opened this issue Mar 6, 2024 · 2 comments · Fixed by #22430 · May be fixed by RobotLocomotion/drake-ci#300
Open

CI Builds are reported as "Failed" even though bazel's exitcode is 0 #21105

BetsyMcPhail opened this issue Mar 6, 2024 · 2 comments · Fixed by #22430 · May be fixed by RobotLocomotion/drake-ci#300
Assignees
Labels
component: continuous integration Jenkins, CDash, mirroring of externals, website infrastructure

Comments

@BetsyMcPhail
Copy link
Contributor

For example, Mac CMake builds were reported as "Failed" with 6 build errors (e.g. https://drake-jenkins.csail.mit.edu/view/Nightly%20Production/job/mac-arm-sonoma-clang-cmake-nightly-everything-release/85/). These errors were actually warnings such as SyntaxWarning: invalid escape sequence... that CMake interpreted as failures.

The CI fix was to update CTEST_CUSTOM_ERROR_EXCEPTION in #21102.

A few things to note:

The equivalent Bazel jobs also show this warning in the output (e.g. https://drake-jenkins.csail.mit.edu/view/Nightly%20Production/job/mac-arm-sonoma-clang-bazel-nightly-everything-release/85/consoleFull) but are shown as passing in CI

Both CMake and Bazel jobs use CTestCustom.cmake.in. CMake from CMakeLists.txt and Bazel from step-build.cmake.

This issue was originally investigated in #2817 and as a result of that issue, the CAPTURE_CMAKE_ERROR option was added to CMake 3.7. See RobotLocomotion/drake-ci#63

Later, in #9268, we attempted to use launchers instead of scraping logs. That change was reverted in #9320.

@BetsyMcPhail BetsyMcPhail added the component: build system Bazel, CMake, dependencies, memory checkers, linters label Mar 6, 2024
@BetsyMcPhail BetsyMcPhail self-assigned this Mar 6, 2024
@jwnimmer-tri jwnimmer-tri added component: continuous integration Jenkins, CDash, mirroring of externals, website infrastructure and removed component: build system Bazel, CMake, dependencies, memory checkers, linters labels Mar 17, 2024
@jwnimmer-tri jwnimmer-tri closed this as not planned Won't fix, can't repro, duplicate, stale Nov 1, 2024
@BetsyMcPhail BetsyMcPhail reopened this Jan 9, 2025
@BetsyMcPhail
Copy link
Contributor Author

@jwnimmer-tri
Copy link
Collaborator

jwnimmer-tri commented Jan 13, 2025

To reproduce the most recent problem:

The PR #22436 brings back the troublesome situation:

(1) The snopt repository rule fix is reverted, so that running a build will produce spurious DEBUG messages.

(2) The CTest regex fix is reverted, so that the DEBUG is (erroneously) tallied as an error.

It's okay that the DEBUG is tallied as an error. Ideally it would be a warning but we can fine-tune that later. What's NOT okay is for the CI result to show red, even though all tests passed.

We'd hoped that RobotLocomotion/drake-ci#300 would fix the "CI is red" problem.

To check, we can open https://drake-jenkins.csail.mit.edu/view/CMake/job/linux-jammy-gcc-cmake-experimental-everything-release/ and (after logging in if required) click "Build with Parameters" and then type in pr/22436/head to the sha1 box and pr/300/head to the ci_sha1 box and then click "Build" to start the job. Even with the fix in CI PR 300, the job is still red.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component: continuous integration Jenkins, CDash, mirroring of externals, website infrastructure
3 participants