[wip][ci] Run cross platform tests #11955

trivialfis · 2026-01-27T08:24:01Z

related: #11944

failure logs:
https://github.com/dmlc/xgboost/actions/runs/21430698435/job/61712473307

Copilot

Pull request overview

Adds CI coverage for cross-platform model portability by training an XGBoost model on Linux GPU, then validating inference on macOS using a freshly built macOS wheel.

Changes:

Add a cross-platform model train/inference script to generate and validate a portable model artifact.
Extend main.yml to build macOS wheels and run a macOS inference job using a model artifact trained on Linux GPU.
Update lint and macOS test environment configs to include the new test directory and adjust dependencies.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`tests/cross-platform/test_cross_platform_model.py`	New script to train a model (GPU) and validate inference on another platform.
`ops/script/lint_python.py`	Adds the new test directory to formatting/type-check paths.
`ops/conda_env/macos_cpu_test.yml`	Updates macOS CI env deps (notably unpins dask/distributed).
`.github/workflows/python_wheels_macos.yml`	Removes standalone macOS wheel workflow (functionality moved into `main.yml`).
`.github/workflows/main.yml`	Adds macOS wheel build job and macOS inference job; uploads/downloads the cross-platform model artifact.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/main.yml

Copilot · 2026-01-27T08:42:12Z

tests/cross-platform/test_cross_platform_model.py

+    accuracy = accuracy_score(y, clf.predict(X))
+
+    clf.get_booster().set_attr(expected_accuracy=str(accuracy))
+    clf.save_model(model_path)
+
+
+def test_inference(model_path: str) -> None:
+    """Load model, run inference and verify accuracy matches."""
+    X, y = get_data()
+
+    clf = xgb.XGBClassifier()
+    clf.load_model(model_path)
+
+    accuracy = accuracy_score(y, clf.predict(X))
+    ea = clf.get_booster().attr("expected_accuracy")
+    assert ea is not None
+    expected_accuracy = float(ea)
+
+    np.testing.assert_allclose(accuracy, expected_accuracy)


This test only asserts that the accuracy matches across platforms, which can miss cross-platform inference differences (e.g., prediction changes that happen to preserve overall accuracy). To make the cross-platform check meaningful, persist a stronger signal in the trained model artifact (e.g., a checksum of predictions/probabilities on a fixed dataset, or the raw prediction vector) and compare that in inference.

ops/conda_env/macos_cpu_test.yml

trivialfis · 2026-01-28T14:37:28Z

I downloaded the model and tested it on a local Linux build and a local Windows build. Both work as expected. Not sure what I'm doing wrong with the macos CI test job here

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-28T14:38:24Z

tests/cross-platform/test_cross_platform_model.py

+def test_inference(model_path: str) -> None:
+    """Load model, run inference and verify accuracy matches."""
+    X, y = get_data()
+
+    clf = xgb.XGBClassifier()


This file/function naming matches pytest discovery (test_*.py and test_*). If someone runs pytest tests (or CI broadens test paths), pytest will collect test_inference(model_path) and error because model_path is treated as a fixture that doesn't exist. Consider renaming the file/functions to avoid pytest collection, or refactor into a proper pytest test with a real fixture.

Copilot · 2026-01-28T14:38:24Z

tests/cross-platform/test_cross_platform_model.py

+        "--model-path",
+        type=str,
+        default="cross_platform_model.ubj",
+        help="Path to model file",
+    )


The linked issue describes the crash when saving in JSON and loading on macOS, but this test defaults to UBJSON (.ubj). To exercise the reported failure mode, consider using a .json model (or explicitly configuring the format) for the cross-platform artifact and inference step.

Copilot · 2026-01-28T14:38:25Z

.github/workflows/main.yml

+        run: |
+          python tests/cross-platform/test_cross_platform_model.py \
+            --inference --model-path cross_platform_model.ubj


The inference job runs the test against cross_platform_model.ubj. Since the linked crash report is for JSON model loading, consider switching this to a .json artifact (or adding a JSON variant) so CI exercises the same code path.

.github/workflows/main.yml

Co-authored-by: Copilot <[email protected]>

trivialfis requested a review from Copilot January 27, 2026 08:37

Copilot started reviewing on behalf of trivialfis January 27, 2026 08:38 View session

Copilot AI reviewed Jan 27, 2026

View reviewed changes

trivialfis mentioned this pull request Jan 27, 2026

train on CUDA and load on CPU but got an EXC_BAD_ACCESS crash #11944

Open

trivialfis requested a review from Copilot January 28, 2026 14:31

Copilot started reviewing on behalf of trivialfis January 28, 2026 14:32 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

trivialfis and others added 6 commits January 28, 2026 22:39

[wip][ci] Run cross platform tests

de37f3b

version.

4390e21

version.

611a740

shell.

0c58c42

Update .github/workflows/main.yml

d804f24

Co-authored-by: Copilot <[email protected]>

try to specify the macos tag.

24aea18

trivialfis force-pushed the ci-cross-jobs-tests branch from 6c6b5cb to 24aea18 Compare January 28, 2026 14:39

trivialfis and others added 2 commits January 28, 2026 22:50

Use minimal env.

91ef54e

Update .github/workflows/main.yml

44c616b

Co-authored-by: Copilot <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wip][ci] Run cross platform tests #11955

[wip][ci] Run cross platform tests #11955

trivialfis commented Jan 27, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 27, 2026

Uh oh!

Uh oh!

trivialfis commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[wip][ci] Run cross platform tests #11955

Are you sure you want to change the base?

[wip][ci] Run cross platform tests #11955

Conversation

trivialfis commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

trivialfis commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

trivialfis commented Jan 27, 2026 •

edited

Loading