[Data] Make `test_hanging_detector_detects_issues` more robust #57567

bveeramani · 2025-10-08T20:05:09Z

Why are these changes needed?

test_hanging_detector_detects_issues checks that Ray Data emits a log if one task takes a lot longer than the others. The issue is that the test doesn't capture the log output correctly, and so the test fails even though Ray data correctly emits the log.

To make this test more robust, this PR uses pytest's caplog fixture to capture the logs rather than a bespoke custom handler.

[2025-10-08T09:00:41Z] >           assert hanging_detected, log_output
  | [2025-10-08T09:00:41Z] E           AssertionError:
  | [2025-10-08T09:00:41Z] E           assert False
  | [2025-10-08T09:00:41Z]
  | [2025-10-08T09:00:41Z] python/ray/data/tests/test_issue_detection_manager.py:153: AssertionError
  |

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run pre-commit jobs to lint the changes in this PR. (pre-commit setup)
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Balaji Veeramani <[email protected]>

gemini-code-assist

Code Review

This pull request significantly improves the robustness of test_hanging_detector_detects_issues by replacing a complex, custom log-capturing mechanism with the standard pytest caplog fixture. This change simplifies the test, makes it easier to understand, and less prone to errors. The addition of the restore_data_context fixture is also a great improvement for test isolation.

I have one suggestion to make the test assertion even more specific and robust.

python/ray/data/tests/test_issue_detection_manager.py

omatthew98

Thanks!

Signed-off-by: Balaji Veeramani <[email protected]>

This reverts commit c338363. Signed-off-by: Balaji Veeramani <[email protected]>

…roject#57567)   ## Why are these changes needed?  `test_hanging_detector_detects_issues` checks that Ray Data emits a log if one task takes a lot longer than the others. The issue is that the test doesn't capture the log output correctly, and so the test fails even though Ray data correctly emits the log. To make this test more robust, this PR uses pytest's `caplog` fixture to capture the logs rather than a bespoke custom handler. ``` [2025-10-08T09:00:41Z] > assert hanging_detected, log_output | [2025-10-08T09:00:41Z] E AssertionError: | [2025-10-08T09:00:41Z] E assert False | [2025-10-08T09:00:41Z] | [2025-10-08T09:00:41Z] python/ray/data/tests/test_issue_detection_manager.py:153: AssertionError | ``` ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run pre-commit jobs to lint the changes in this PR. ([pre-commit setup](https://docs.ray.io/en/latest/ray-contribute/getting-involved.html#lint-and-formatting)) - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <[email protected]> Signed-off-by: Josh Kodi <[email protected]>

Initial commit

0b90305

Signed-off-by: Balaji Veeramani <[email protected]>

bveeramani requested a review from a team as a code owner October 8, 2025 20:05

bveeramani assigned iamjustinhsu Oct 8, 2025

gemini-code-assist bot reviewed Oct 8, 2025

View reviewed changes

python/ray/data/tests/test_issue_detection_manager.py Show resolved Hide resolved

omatthew98 approved these changes Oct 8, 2025

View reviewed changes

iamjustinhsu approved these changes Oct 8, 2025

View reviewed changes

aslonnie added the go add ONLY when ready to merge, run all tests label Oct 8, 2025

ray-gardener bot added the data Ray Data-related issues label Oct 9, 2025

bveeramani added 2 commits October 9, 2025 08:40

Initial commit

c338363

Signed-off-by: Balaji Veeramani <[email protected]>

Revert "Initial commit"

79bb8d7

This reverts commit c338363. Signed-off-by: Balaji Veeramani <[email protected]>

bveeramani merged commit 070820e into master Oct 10, 2025
6 checks passed

bveeramani deleted the refacot-issue-detection branch October 10, 2025 22:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Data] Make `test_hanging_detector_detects_issues` more robust #57567

[Data] Make `test_hanging_detector_detects_issues` more robust #57567

Uh oh!

bveeramani commented Oct 8, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

omatthew98 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Data] Make test_hanging_detector_detects_issues more robust #57567

[Data] Make test_hanging_detector_detects_issues more robust #57567

Uh oh!

Conversation

bveeramani commented Oct 8, 2025

Why are these changes needed?

Related issue number

Checks

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

omatthew98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Data] Make `test_hanging_detector_detects_issues` more robust #57567

[Data] Make `test_hanging_detector_detects_issues` more robust #57567