flowey: new xflowey vmm-tests command #1405

tjones60 · 2025-05-22T22:48:46Z

A new xflowey command that builds everything necessary to run the VMM tests.

justus-camp-microsoft · 2025-05-22T22:58:47Z

Oh wow this is exciting - I've been wanting to look at trying to implement this

tjones60 · 2025-05-22T23:05:30Z

For example, to build everything for ARM and copy it to a windows directory:

cargo xflowey vmm-tests --target windows-aarch64 --dir /mnt/e/vmm_tests_aarch64 --build-only --copy-extras --install-missing-deps --unstable-whp

Then run on the destination machine:

$env:OPENVMM_LOG='debug,mesh_node=info'
$env:RUST_BACKTRACE='1'
$env:RUST_LOG='trace,mesh_node=info'
$env:TEST_OUTPUT_PATH='C:\vmm_tests\test_results'
$env:VMM_TESTS_CONTENT_DIR='C:\vmm_tests'
$env:VMM_TEST_IMAGES='C:\vmm_tests\images'
$env:WSLENV='WT_SESSION:WT_PROFILE_ID::OPENVMM_LOG:RUST_BACKTRACE:RUST_LOG:TEST_OUTPUT_PATH:VMM_TESTS_CONTENT_DIR:VMM_TEST_IMAGES'
.\cargo-nextest.exe nextest run `
	--profile default `
	--config-file 'C:\vmm_tests\nextest.toml' `
	--workspace-remap 'C:\vmm_tests\' `
	--archive-file 'C:\vmm_tests\vmm-tests-archive.tar.zst' `
	--filter-expr 'all()'

(I will document this and maybe include the command as a script file in the output dir)

flowey/flowey_lib_hvlite/src/_jobs/cfg_versions.rs

flowey/flowey_lib_hvlite/src/_jobs/local_build_and_run_nextest_vmm_tests.rs

daniel5151 · 2025-05-27T15:16:39Z

Out of curiosity - is the idea of having a flowey-backed petri_artifact_resolver still something that's on folk's radar?

This approach, where flowey drives nextest execution, has the unfortunate property of having to write and maintain an entirely new in-house CLI for executing tests (i.e: the new xflowey vmm-tests CLI), with all the overhead of maintaining the end-user docs, as well as ensuring the semantic mapping between the CLI and the underlying nextest filters / artifact dependencies is kept up-to-date as more tests are written*. Also, note that I've actually gone down this route before (back when I had that cargo xjob experiment that never went anyways), so this is stuff that I've already ran into / thought about previously.

The inverse approach, where users invoke cargo nextest run directly, and the test runner "shells-out" to flowey in order to JIT resolve required test artifacts, should be a lot easier to scale and maintain long-term, both from a user and maintainer perspective. Maintainers don't need to write a bespoke CLI (and corresponding docs), nor do they need to explicitly maintain any semantic maps between their bespoke CLI and the underlying tests / test-filters. Users can leverage existing open-source docs on using nextest / composing nextest filters, and sidestep needing to learn yet another project-specific tool.

There's also the "middle ground" approach, where there is a petri_artifact_resolver that could generate a test_artifact_manifest.json/toml file on a particular cargo nextest run invocation, which would generate / read-from a k/v list of artifacts to backing files (e.g: with the schema "key": " path/to/artifact" | "$unresolved"), and then have a cargo xflowey resolver-test-artifacts tool that could be used to resolve the required artifacts from the test_artifact_manifest file. This approach has the added benefit of making it really easy to slot in your own "private" artifacts (i.e: just swap the path to your custom artifact - no "magic path" nonesense required)

Anyways, just some food for thought. If this is the easiest on-ramp to enable simple local test runs, then obviously #shipit

*then again, you're already doing that in the CI case anyways, so as long as you've got good code-sharing between those two paths, maybe its not a big deal.

jstarks · 2025-05-27T19:49:39Z

Having cargo nextest run transparently build is a bit awkward, because you'd be running a bunch of tests in parallel, which would then need to coordinate. Plus this would pollute the test log output terribly, especially if any of the build steps fail (which they often do during development).

So, I think we'd still want a wrapper command, even if we did more to automate the discovery of what needed to be built.

daniel5151 · 2025-05-27T20:45:49Z

Sounds like the hybrid, multi-command approach of having a test_artifact_manifest.json/toml file would be a good balance then?

i.e: a user workflow would look like:

attempt to run a test (via cargo nextest run [user-test-filter])
initial test run will notice that the test_artifact_manifest.json/toml hasn't been generated / contains "unresolved" entries, resulting in the test run terminating early (with a helpful error message)
user then runs cargo xflowey resolve-test-artifacts path/to/test_artifact_manifest to build + resolve the missing artifacts
user re-runs original command, with tests successfully running now that all artifacts have been built

If a "one click" wrapper is needed, it can certainly be implemented on-top of this workflow, but it really seems unfortunate to have yet another bespoke, complex in-house CLI for folks to memorize (vs. the relatively "simple" resolve-test-artifacts CLI I'm suggesting here).

tjones60 · 2025-05-29T21:18:34Z

Thanks for your input Daniel, I think that some sort of json manifest with an automatic artifact resolver would be great. Maybe I'll work on that next, but for now I think this is a good first step to making it easier to run the vmm tests. I think either way, we'll want to have a wrapper command that does everything, so that improvement could slot into this design without much change to the interface.

daniel5151 · 2025-05-30T00:08:23Z

Hey, you know me - I've got no skin in the game here, just sharing my thoughts from afar 😉

The only other food-for-thought question I'll leave is wrt the long-term maintenance overhead of this current approach.

As is, this approach seems to be taking a "procedural" approach to defining test classes + their corresponding artifact requirements, which means that as y'all onboard more tests, the wrapper will also need to be updated / tested to work correctly, right? Does that mean that subsequent PRs that introduce new tests will require manual, pre-commit validation from the author that they work correctly with xflowey vmm-tests? Is that just something that will be manually enforced?

If there's already buy-in with folks on the team that this will be part of the vmm_test authoring workflow, then sure, that seems workable. If not... it would be unfortunate if someone became the "xflowey vmm-tests cop", manually fixing up the infrastructure from "time to time".

I'll also note that - by going with a manifest based artifact resolver - you can always add something along the lines of a gh_actions_artifact_resolver for use in CI (or even locally?), which would even allow CI vmm-test runs to switch over to a "declarative" artifact wiring model. i.e: instead of needing to have the VMM tests job explicitly invoke a init_vmm_test_env job - the job could instead defer that logic to a runtime artifact resolver that can dynamically download artifacts generated by previous jobs in the pipeline run.

This is, of course, another stretch goal, but the point is that if you can execute on switching vmm-tests artifacts over to a declarative model for both local and in CI scenarios, you no longer need to worry (as much) about manually maintaining all these various test "classes" and "filters" and "artifact set". Instead - you simply declare what tests you want to run... and then the resolver figures everything out for you.

tjones60 requested review from a team as code owners May 22, 2025 22:48

jstarks reviewed May 23, 2025

View reviewed changes

flowey/flowey_lib_hvlite/src/_jobs/cfg_versions.rs Show resolved Hide resolved

jstarks reviewed May 23, 2025

View reviewed changes

flowey/flowey_lib_hvlite/src/_jobs/local_build_and_run_nextest_vmm_tests.rs Outdated Show resolved Hide resolved

jstarks reviewed May 23, 2025

View reviewed changes

flowey/flowey_lib_hvlite/src/_jobs/local_build_and_run_nextest_vmm_tests.rs Show resolved Hide resolved

tjones60 added 10 commits May 30, 2025 16:44

wip

a1cca90

progress

ea6fc0d

wip 2

775a04f

download nextest

c9e2e4e

builds and runs

aa8ff13

build only what is needed

1cb2871

split out target

4fadeb9

flowey

7ce3d3c

flags

7199909

separate folders for nextest downloads

fff13c1

tjones60 force-pushed the xflowey_vmmtests branch from e8e50e1 to fff13c1 Compare May 30, 2025 23:44

create dir

74017d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

flowey: new xflowey vmm-tests command #1405

flowey: new xflowey vmm-tests command #1405

Uh oh!

tjones60 commented May 22, 2025

Uh oh!

justus-camp-microsoft commented May 22, 2025

Uh oh!

tjones60 commented May 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel5151 commented May 27, 2025 •

edited

Loading

Uh oh!

jstarks commented May 27, 2025

Uh oh!

daniel5151 commented May 27, 2025 •

edited

Loading

Uh oh!

tjones60 commented May 29, 2025

Uh oh!

daniel5151 commented May 30, 2025

Uh oh!

Uh oh!

flowey: new xflowey vmm-tests command #1405

Are you sure you want to change the base?

flowey: new xflowey vmm-tests command #1405

Uh oh!

Conversation

tjones60 commented May 22, 2025

Uh oh!

justus-camp-microsoft commented May 22, 2025

Uh oh!

tjones60 commented May 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel5151 commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jstarks commented May 27, 2025

Uh oh!

daniel5151 commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tjones60 commented May 29, 2025

Uh oh!

daniel5151 commented May 30, 2025

Uh oh!

Uh oh!

daniel5151 commented May 27, 2025 •

edited

Loading

daniel5151 commented May 27, 2025 •

edited

Loading