filemanager: add crawl function #852

mmalenic · 2025-02-04T02:59:53Z

I think the filemanager API should have a crawl function. This could be based on the existing inventory function, but I think it might be more convenient to just use list operations directly. Often records become out of date due to missing features or bugs, so along with ingesting new objects, the crawl function could be used to correct records if they are inconsistent. By placing this inside the filemanager directly, S3 API code can be re-used, rather than tying the logic to update scripts and individual API calls.

Related to #844.

The text was updated successfully, but these errors were encountered:

alexiswl · 2025-02-05T05:28:14Z

@mmalenic could this be a workaround in the interim? https://stackoverflow.com/a/37475908/6946787

mmalenic · 2025-02-05T06:14:39Z

Possibly.. All the filemanager needs to link the objects is the umccr-org:OrcaBusFileManagerIngestId tag which is present on objects, which should have the value of the ingest_id. So if copying from source to destination also copies the tags and that triggers a new Object Created event, then the filemanager should pick up the existing tag.

mmalenic · 2025-02-05T06:43:57Z

Sorry to clarify, the tag with the ingest id is already present, so I think this should work if it creates a new Object Created event.

alexiswl · 2025-02-05T08:10:16Z

No stress, for these runs we're compressing them so they will generate new objects anyway

mmalenic self-assigned this Feb 4, 2025

mmalenic added feature New feature filemanager an issue relating to the filemanager labels Feb 4, 2025

mmalenic mentioned this issue Feb 12, 2025

feat: filemanager crawl #859

Merged

mmalenic closed this as completed in #859 Feb 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

filemanager: add crawl function #852

filemanager: add crawl function #852

mmalenic commented Feb 4, 2025 •

edited

Loading

alexiswl commented Feb 5, 2025

mmalenic commented Feb 5, 2025

mmalenic commented Feb 5, 2025

alexiswl commented Feb 5, 2025

filemanager: add crawl function #852

filemanager: add crawl function #852

Comments

mmalenic commented Feb 4, 2025 • edited Loading

alexiswl commented Feb 5, 2025

mmalenic commented Feb 5, 2025

mmalenic commented Feb 5, 2025

alexiswl commented Feb 5, 2025

mmalenic commented Feb 4, 2025 •

edited

Loading