Scope out how we might organize and vend extract files #2454

chouinar · 2024-10-11T14:49:02Z

Summary

The legacy website provides the ability to download XML extracts of opportunity data: https://www.grants.gov/xml-extract - this is something we should also support. Having extracts (especially if they're just the same data as our search endpoints) reduces the traffic our search endpoint will need to handle as we can point anyone that would want to scrape our data to an extract file.

We already built a quick initial script for generating the files, but don't meaningfully store them anywhere or have a way to look them up - this ticket should focus on figuring that idea out.

A few high-level thoughts:

The files will be stored on s3
Whatever approach we use for vending other files (s3 presigned URLs or a CDN link) we should use here as well
We'll likely want a table that stores what files we've generated as well as an API that can query that table to get the latest records
We may have multiple file formats for a given extract batch (csv and json for now)
It's plausible that in the future we will have more extracts, try to come up with an approach that could handle multiple sets of files (ie. the endpoint might require you to tell it what files you want to lookup)

Acceptance criteria

Scope out a plan
Run the idea past design, even if they don't quite yet have time for designs, make sure the idea of a page for extracts that they have matches what we might do for an API
Write up the follow-up implementation tickets

mikehgrantsgov · 2024-11-05T15:37:48Z

Here are some tickets that can spawned from this:

DB:
Create a table to track generated files, including metadata like file type, creation date, batch ID?
API:
Build an API to query the database for the latest file records.
Note: support multiple file formats (CSV, JSON) and explore future formats.
Design:
Placeholder to collaborate with design to ensure the user interface aligns.
Future scalability spike?
Plan for future expansion to handle multiple sets of files and additional formats?

mikehgrantsgov · 2024-11-05T16:44:48Z

@chouinar should the tickets above be created? I can create them if you think we are ready. Anyone/team should be tagged for input beforehand?

chouinar · 2024-11-07T19:10:02Z

@mikehgrantsgov - go ahead and create the tickets, some of the design work will be a TODO, but can't hurt to set them up

widal001 · 2024-11-08T22:27:02Z

Beep boop: Automatically setting the point and sprint values for this issue in project HHS/13 because they were unset when the issue was closed.

mikehgrantsgov · 2024-11-08T22:27:39Z

Created the following:
#2791
#2792
#2793
#2794
#2795

chouinar added this to Simpler.Grants.gov Product Backlog Oct 11, 2024

github-project-automation bot moved this to Icebox in Simpler.Grants.gov Product Backlog Oct 11, 2024

mikehgrantsgov self-assigned this Nov 5, 2024

mikehgrantsgov moved this from Icebox to Todo in Simpler.Grants.gov Product Backlog Nov 5, 2024

mikehgrantsgov moved this from Todo to In Progress in Simpler.Grants.gov Product Backlog Nov 5, 2024

mikehgrantsgov moved this from In Progress to Done in Simpler.Grants.gov Product Backlog Nov 8, 2024

mikehgrantsgov closed this as completed by moving to Done in Simpler.Grants.gov Product Backlog Nov 8, 2024

dbateyko mentioned this issue Feb 3, 2025

Creating an archive of Full Announcement attachments #3742

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scope out how we might organize and vend extract files #2454

Scope out how we might organize and vend extract files #2454

chouinar commented Oct 11, 2024 •

edited by mikehgrantsgov

Loading

mikehgrantsgov commented Nov 5, 2024

mikehgrantsgov commented Nov 5, 2024

chouinar commented Nov 7, 2024

widal001 commented Nov 8, 2024

mikehgrantsgov commented Nov 8, 2024

Scope out how we might organize and vend extract files #2454

Scope out how we might organize and vend extract files #2454

Comments

chouinar commented Oct 11, 2024 • edited by mikehgrantsgov Loading

Summary

Acceptance criteria

mikehgrantsgov commented Nov 5, 2024

mikehgrantsgov commented Nov 5, 2024

chouinar commented Nov 7, 2024

widal001 commented Nov 8, 2024

mikehgrantsgov commented Nov 8, 2024

chouinar commented Oct 11, 2024 •

edited by mikehgrantsgov

Loading