feat!: introduce confidence scores for check facts #620

behnazh-w · 2024-01-31T03:00:56Z

Confidence score and justification type

This PR allows specifying confidence scores for check results, which is especially useful when a check reports multiple candidate results. All of these confidence scores are added to the check tables in the database. However, the fact that has the highest confidence is shown in the HTML/JSON report only.
The justifications are no longer required to be added manually to the CheckResultData. Instead, they are curated directly from the results in the table. If a column has specified JustificationType in the column mapping, it will be picked up automatically and rendered as plain text or href depending on the specified type. If a check fails or is skipped, we show a default Not Available. justification. This allows to create HTML/JSON reports from the database reproducibly.

Refactoring

For this feature to work, the following refactorings are done as well:

The checks are changed to add multiple facts to a result table.
The confidence score is added to CheckFacts ORM mapping, which all the check mappings inherit from. That caused some circular dependency issues for the Expectation ORM mapping. I have refactored and improved this mapping accordingly.
I have improved the check table columns that needed to be added as justification. For example, I have added asset_url to ProvenanceAvailableFacts. That required adding a new SLSAProvenanceData wrapper class for GitHub release provenances.

Documentation

I have added elaborate instructions for adding a new check and explained the current interface.

Signed-off-by: behnazh-w <[email protected]>

docs/source/pages/developers_guide/index.rst

src/macaron/slsa_analyzer/checks/check_result.py

docs/source/pages/developers_guide/index.rst

src/macaron/slsa_analyzer/checks/check_result.py

docs/source/pages/developers_guide/index.rst

src/macaron/slsa_analyzer/checks/check_result.py

src/macaron/slsa_analyzer/analyze_context.py

src/macaron/slsa_analyzer/checks/check_result.py

nicallen · 2024-02-05T01:25:47Z

Under this new schema, the confidence score is recorded for a check_facts entry, which currently contains multiple pieces of information that would be forced to share a confidence score. For example, the build_as_code_check entries contain both ci_service_name and deploy_command which are derived by different means and hence may have different levels of confidence, which cannot be expressed in the current schema. This could be fixed this by splitting the build_as_code_check and similar tables into smaller tables with more granularity of information. Are you intending to do that in later changes?

src/macaron/slsa_analyzer/specs/ci_spec.py

behnazh-w · 2024-02-06T00:41:40Z

Under this new schema, the confidence score is recorded for a check_facts entry, which currently contains multiple pieces of information that would be forced to share a confidence score. For example, the build_as_code_check entries contain both ci_service_name and deploy_command which are derived by different means and hence may have different levels of confidence, which cannot be expressed in the current schema. This could be fixed this by splitting the build_as_code_check and similar tables into smaller tables with more granularity of information. Are you intending to do that in later changes?

For this particular example, I think of deploy_command as the main result of the build_as_code_check , which can have different confidence scores based on the analysis. The ci_service_name fact, however, is an informational column that shows in which CI configuration the deploy command was found. So, I don't see it necessary to split the table in this case. But if a check can generate more than one fact, and each fact needs to have a different score, I agree that splitting to two smaller tables would solve the problem.

Signed-off-by: behnazh-w <[email protected]>

docs/source/pages/developers_guide/index.rst

Signed-off-by: behnazh-w <[email protected]>

docs/source/pages/developers_guide/index.rst

nathanwn

I think the PR should be good to merge.
Thanks for the PR.

docs/source/pages/developers_guide/index.rst

src/macaron/database/table_definitions.py

Signed-off-by: behnazh-w <[email protected]>

benmss

Looks good to me.

This PR changes the data model and allows specifying confidence scores for check results, which is especially useful when a check reports multiple candidate results. All of these confidence scores are added to the check tables in the database. However, the fact that has the highest confidence is shown in the HTML/JSON report only. The justifications are no longer required to be added manually to the CheckResultData. Instead, they are curated directly from the results in the table. If a column has specified JustificationType in the column mapping, it will be picked up automatically and rendered as plain text or href depending on the specified type. If a check fails or is skipped, we show a default Not Available. justification. This allows to create HTML/JSON reports from the database reproducibly. Signed-off-by: behnazh-w <[email protected]>

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Jan 31, 2024

feat: introduce confidence scores for check facts

7f92fe2

Signed-off-by: behnazh-w <[email protected]>

behnazh-w force-pushed the behnazh/improve-build-as-code branch from 9968c46 to 7f92fe2 Compare January 31, 2024 04:32

docs: add instructions for writing new checks

9aedac6

Signed-off-by: behnazh-w <[email protected]>

behnazh-w force-pushed the behnazh/improve-build-as-code branch from 01d446e to 9aedac6 Compare February 1, 2024 03:49

docs: update the data model diagram

aa93b0c

Signed-off-by: behnazh-w <[email protected]>

behnazh-w marked this pull request as ready for review February 1, 2024 05:32

behnazh-w requested a review from tromai as a code owner February 1, 2024 05:32

behnazh-w requested review from nicallen, nathanwn and benmss February 1, 2024 05:33

benmss reviewed Feb 1, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/checks/check_result.py Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/checks/check_result.py Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/checks/check_result.py Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/analyze_context.py Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/analyze_context.py Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/checks/check_result.py Outdated Show resolved Hide resolved

nathanwn reviewed Feb 2, 2024

View reviewed changes

src/macaron/slsa_analyzer/checks/check_result.py Show resolved Hide resolved

nathanwn reviewed Feb 6, 2024

View reviewed changes

src/macaron/slsa_analyzer/specs/ci_spec.py Outdated Show resolved Hide resolved

chore: address PR comments

d25bf57

Signed-off-by: behnazh-w <[email protected]>

behnazh-w force-pushed the behnazh/improve-build-as-code branch from bc4374a to d25bf57 Compare February 11, 2024 22:29

oracle deleted a comment from behnazh-w Feb 12, 2024

nathanwn reviewed Feb 13, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

docs: fix the formatting issue in docs

4661c00

Signed-off-by: behnazh-w <[email protected]>

behnazh-w changed the title ~~feat: introduce confidence scores for check facts~~ feat!: introduce confidence scores for check facts Feb 13, 2024

nathanwn reviewed Feb 13, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

nathanwn approved these changes Feb 13, 2024

View reviewed changes

benmss reviewed Feb 14, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 14, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 14, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 14, 2024

View reviewed changes

docs/source/pages/developers_guide/index.rst Outdated Show resolved Hide resolved

benmss reviewed Feb 14, 2024

View reviewed changes

src/macaron/database/table_definitions.py Outdated Show resolved Hide resolved

chore: improve the docs

08eb1ce

Signed-off-by: behnazh-w <[email protected]>

benmss approved these changes Feb 14, 2024

View reviewed changes

nicallen approved these changes Feb 15, 2024

View reviewed changes

behnazh-w merged commit db3231f into staging Feb 15, 2024

nathanwn deleted the behnazh/improve-build-as-code branch February 15, 2024 01:59

tromai mentioned this pull request Feb 27, 2024

Add developer docs for the check interface #535

Closed

feat!: introduce confidence scores for check facts #620

feat!: introduce confidence scores for check facts #620

Uh oh!

Conversation

behnazh-w commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Confidence score and justification type

Refactoring

Documentation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicallen commented Feb 5, 2024

Uh oh!

Uh oh!

behnazh-w commented Feb 6, 2024

Uh oh!

Uh oh!

Uh oh!

nathanwn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benmss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

behnazh-w commented Jan 31, 2024 •

edited

Loading