Test runner. #1999

chriseth · 2024-10-31T10:10:52Z

Simple command line utility that can run tests in a .asm file.

georgwiese

What do you think of moving the test runner to the pipeline crate? The we can use it to simplify tests like this

georgwiese · 2024-10-31T10:41:57Z

cli/src/test_runner.rs

+    for (name, (_, val)) in analyzed
+        .definitions
+        .iter()
+        .filter(|(n, _)| n.starts_with("test::test_") || n.contains("::test::test_"))


Could you rename the tests in std/math/fp{2,4}.asm to start with test_?

georgwiese · 2024-10-31T10:44:00Z

cli/src/test_runner.rs

+        }
+        print!("Running test: {name}...");
+        let function = symbols.lookup(name, &None).unwrap();
+        evaluator::evaluate_function_call::<F>(function, vec![], &mut symbols).unwrap();


I think we could provide a better error message (e.g. mention the name of the test) and continue running the other tests maybe?

Right now, a failing test looks like this:

Running test: std::math::fp2::test::test_add... ok Running test: std::math::fp2::test::test_mul... ok thread 'main' panicked at cli/src/test_runner.rs:58:80: called `Result::unwrap()` on an `Err` value: FailedAssertion("Wrong subtraction result") note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace Running test: std::math::fp2::test::test_sub...%

(test_sub failed)

Oh interesting! I thought we implemented the built-in panic by using panic!(), but we actually return an Error. How convenient!

georgwiese · 2024-10-31T10:46:34Z

cli/src/main.rs

+        /// Also run the tests inside the standard library.
+        #[arg(long)]
+        #[arg(default_value_t = false)]
+        include_std_tests: bool,


Do we need this flag? Seems like a hack to be able to run all std tests (by passing any file to powdr test <file> --iclude-std-tests) before we have #2000, but seems like the cleaner solution would be to implement #2000 and do powdr test std?

It filters tests based on their namespace. Since std is always included, we need to filter the tests out. If you run powdr test std with the filter active, it will not run any tests.

Hmm, but does this need to leak to the CLI? Or could we just say we never want to run the std tests, unless we pass std explicitly?

georgwiese · 2024-10-31T14:33:42Z

cli/src/main.rs

+        /// Also run the tests inside the standard library.
+        #[arg(long)]
+        #[arg(default_value_t = false)]
+        include_std_tests: bool,


Hmm, but does this need to leak to the CLI? Or could we just say we never want to run the std tests, unless we pass std explicitly?

georgwiese · 2024-10-31T14:48:18Z

pipeline/tests/powdr_std.rs

-        "std::protocols::fingerprint::test::test_fingerprint",
-        vec![],
+fn std_tests() {
+    let test_count = 9;


Is it important to assert that there are 9 tests? In what scenario would it not run all tests? I don't see a good reason and it will be slightly annoying that we have to change this number when adding a new test.

It's just a safeguard against missing some tests because of the naming scheme or something.

georgwiese · 2024-10-31T14:50:17Z

pipeline/tests/powdr_std.rs

+    assert_eq!(
+        test_count,
+        run_tests(&std_analyzed::<GoldilocksField>(), true).unwrap()
+    );
+    assert_eq!(
+        test_count,
+        run_tests(&std_analyzed::<Bn254Field>(), true).unwrap()
+    );
+    assert_eq!(
+        test_count,
+        run_tests(&std_analyzed::<BabyBearField>(), true).unwrap(),


We also have BabyBear and KoalaBear, actually!

Also, this approach assumes that all tests should be run on all fields. I think we might have tests in the future that are only supposed to pass on some fields. But we we can also fix that when we get there.

Sure but then the test function should return early. We might find a more convenient way, though.

georgwiese · 2024-10-31T14:52:35Z

pipeline/src/test_runner.rs

+        match evaluator::evaluate_function_call::<F>(function, vec![], &mut symbols) {
+            Err(e) => {
+                let msg = e.to_string();
+                println!("{padding}failed\n  {msg}");


beautiful 😍

georgwiese

LGTM

chriseth requested a review from georgwiese October 31, 2024 10:10

chriseth force-pushed the test_runner branch from 9d7606f to 2757ee8 Compare October 31, 2024 10:24

Test runner.

e712ab1

chriseth force-pushed the test_runner branch from 2757ee8 to e712ab1 Compare October 31, 2024 10:24

chriseth mentioned this pull request Oct 31, 2024

Support directories in test runner #2000

Open

georgwiese reviewed Oct 31, 2024

View reviewed changes

chriseth added 8 commits October 31, 2024 13:34

Rename tests, better formatting and continue running.

9e80a18

Move to pipeline.

14039f2

Move test runner.

885c9ac

Group all std tests.

5157d33

Remove intentional test failure.

19cf594

Print field.

7a4d8cf

Also execute test functions in sub-modules.

6eeb7ed

Remove old test utility.

c5174be

georgwiese reviewed Oct 31, 2024

View reviewed changes

Remove the "also std" option.

28b5f46

georgwiese mentioned this pull request Oct 31, 2024

Improve fingerprint expression performance. #1985

Merged

simplify test count check.

f42e9ba

georgwiese approved these changes Oct 31, 2024

View reviewed changes

chriseth added this pull request to the merge queue Oct 31, 2024

Merged via the queue into main with commit b2a48d7 Oct 31, 2024
14 checks passed

chriseth deleted the test_runner branch October 31, 2024 19:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test runner. #1999

Test runner. #1999

chriseth commented Oct 31, 2024 •

edited

Loading

georgwiese left a comment

georgwiese Oct 31, 2024

georgwiese Oct 31, 2024

chriseth Oct 31, 2024

georgwiese Oct 31, 2024

chriseth Oct 31, 2024

georgwiese Oct 31, 2024

georgwiese Oct 31, 2024

georgwiese Oct 31, 2024

chriseth Oct 31, 2024

georgwiese Oct 31, 2024

georgwiese Oct 31, 2024

chriseth Oct 31, 2024

georgwiese Oct 31, 2024

georgwiese left a comment

Test runner. #1999

Test runner. #1999

Conversation

chriseth commented Oct 31, 2024 • edited Loading

georgwiese left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

georgwiese left a comment

Choose a reason for hiding this comment

chriseth commented Oct 31, 2024 •

edited

Loading