refactor(api): refactor purl API query code #2900

hogo6002 · 2024-11-26T05:32:55Z

Converted PURL queries to package queries in do_query() to remove unnecessary code. Querying the Datastore directly with PURLs gives the same results as querying by package name, so we can simplify things by treating PURLs like regular package queries after extracting the package/ecosystem/version.

This will also resolve issue #2842 by rewriting the PURL-to-ecosystem logic. For most ecosystems, we can get the name from purl.type, but for Linux distributions, we need to use purl.namespace.

andrewpollock

Nice simplification!

osv/purl_helpers.py

michaelkedar

Nice!

osv/purl_helpers.py

oliverchang

very nice! Is there a way we can do some kind of testing to make sure we're not introducing any regressions on matching behaviour?

i.e. some kind of comprehensive comparison between test (with this deployed) and prod (without this deployed) for specific ecosystems?

osv/purl_helpers.py

hogo6002 · 2024-11-27T04:08:49Z

very nice! Is there a way we can do some kind of testing to make sure we're not introducing any regressions on matching behaviour?

We have integration tests for the API (LONG_TESTS), which I have manually checked. These tests do not produce any 500 errors. Also, after checking in the test instance, it will be tested by performance testing, and I will monitor for any new 500 errors. I think this change should be safe.

hogo6002 · 2024-11-27T05:01:53Z

gcp/api/fixtures/IntegrationTests_api_query_response.txt

-    '200:{"commit": "d374094d8c49b6b7d288f307e11217ec5a502391", "package": {"purl": "pkg:pypi/[email protected]", "version": "0.8.5"}}\n',
-    '200:{"commit": "d374094d8c49b6b7d288f307e11217ec5a502391", "package": {"purl": "pkg:pypi/[email protected]"}}\n',
-    '200:{"commit": "d374094d8c49b6b7d288f307e11217ec5a502391", "package": {"version": "0.8.5"}}\n',
+[   '400:{"commit": "d374094d8c49b6b7d288f307e11217ec5a502391", "package": '


This long test has been skipped and hasn't been updated in a long time (it is also failing for the master branch). The script is generated using the command TESTS_GENERATE=1. Most of the 400 failures are caused by querying both the PURL and package name at the same time, which is invalid.

We could also re-enable long tests on the pipeline (run after merge), it doesn't run too long (maybe 1 min)

gcp/api/server.py

osv/purl_helpers.py

refactor(api): refactor api query for purl

320917b

hogo6002 requested review from oliverchang, andrewpollock and another-rex November 26, 2024 05:32

add dependency to root pyproject.toml

686fa73

andrewpollock reviewed Nov 26, 2024

View reviewed changes

osv/purl_helpers.py Outdated Show resolved Hide resolved

oliverchang requested a review from michaelkedar November 26, 2024 22:22

michaelkedar reviewed Nov 26, 2024

View reviewed changes

osv/purl_helpers.py Outdated Show resolved Hide resolved

osv/purl_helpers.py Outdated Show resolved Hide resolved

hogo6002 added 2 commits November 27, 2024 14:10

refactor parse_purl and update intergration test

04662aa

add comment for golang purl parsing

e624c2c

hogo6002 requested review from michaelkedar and andrewpollock November 27, 2024 03:21

hogo6002 added 3 commits November 27, 2024 14:22

fix lint

11f7c57

add packageURL dependency to worker and website

9ed4e26

merge from master and resolve conflicts

9943bb9

oliverchang reviewed Nov 27, 2024

View reviewed changes

osv/purl_helpers.py Outdated Show resolved Hide resolved

update to namedtuple and update test generate

2dd312d

hogo6002 commented Nov 27, 2024

View reviewed changes

another-rex reviewed Nov 27, 2024

View reviewed changes

gcp/api/server.py Show resolved Hide resolved

gcp/api/server.py Show resolved Hide resolved

gcp/api/server.py Outdated Show resolved Hide resolved

osv/purl_helpers.py Outdated Show resolved Hide resolved

andrewpollock approved these changes Nov 27, 2024

View reviewed changes

osv/purl_helpers.py Outdated Show resolved Hide resolved

osv/purl_helpers.py Outdated Show resolved Hide resolved

throw error in parse_purl

fb0b5d4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(api): refactor purl API query code #2900

refactor(api): refactor purl API query code #2900

hogo6002 commented Nov 26, 2024

andrewpollock left a comment

michaelkedar left a comment

oliverchang left a comment •

edited

Loading

hogo6002 commented Nov 27, 2024

hogo6002 Nov 27, 2024

refactor(api): refactor purl API query code #2900

Are you sure you want to change the base?

refactor(api): refactor purl API query code #2900

Conversation

hogo6002 commented Nov 26, 2024

andrewpollock left a comment

Choose a reason for hiding this comment

michaelkedar left a comment

Choose a reason for hiding this comment

oliverchang left a comment • edited Loading

Choose a reason for hiding this comment

hogo6002 commented Nov 27, 2024

hogo6002 Nov 27, 2024

Choose a reason for hiding this comment

oliverchang left a comment •

edited

Loading