Provisioning of tasks into the database #44

tgeoghegan · 2022-03-28T19:59:47Z

David raised an interesting question in #37, which is whether an aggregator instance would receive the list of task IDs for which it should handle requests and work from configuration or just look up all the tasks defined in the database.

Either way, this begs the question: how do the tasks get written into the database to begin with? And how do the secrets (like HPKE private keys) get into Kubernetes secrets? We will need tools and a process for doing this.

tgeoghegan · 2022-03-28T20:03:37Z

My preference would be to avoid doing this in something like the old prio-server deploy-tool. Principally, I think we should avoid ever having to expose secrets like this to operator machines. Maybe we could stand up a service that acts as the Divvi Up control plane, and creating a new task would be a matter of making an authenticated API request asking it to make a new task and perform the necessary key generation. Then we could have either humans operators do that, or rig up something in GitHub Actions that would automatically make the task provisioning requests based on a config file checked into janus-ops.

This control plane service could eventually grow into the service that handles customer accounts and self-service task onboarding.

divergentdave · 2022-06-16T16:08:54Z

An alternate intermediate solution would be to write a standalone binary that provisions a task in the database, but then run it inside the cluster as a Job. We could write a template job manifest file, fill in arguments with parameters like TaskId, and then apply the completed manifest by hand. This would be easy to get up and running, and still generate and handle secrets on the cluster.

branlwyd · 2022-06-16T16:16:38Z

I think for anything we do that touches real user data should use such a strategy (or, in general, should use a strategy where we generate secrets in the cluster).

Initial interop testing & other deployments that only process fake or otherwise-non-sensitive data can generate secrets anywhere, including on an operator workstation, IMO.

tgeoghegan · 2022-12-14T20:53:39Z

Another responsibility of this control plane service would be coordinating the other aggregator for a task. Our API for provisioning a task with Janus as the leader would also allow subscribers to configure a helper aggregator, perhaps by specifying an endpoint URL. The helper would then need to support an API for task provisioning so that we could send it the task parameters as well as secret values like the VDAF verify key and an authentication token.

tgeoghegan · 2023-06-16T03:57:16Z

The ideas here have been refined into the proposal in #1486. I've filed issues in the production readiness milestone tracking specific implementation details. This issue doesn't track anything actionable and contains outdated discussion, so I'm closing it.

tgeoghegan mentioned this issue Nov 18, 2022

How to provision new tasks on the fly #760

Closed

tgeoghegan added this to the Production readiness milestone Jan 23, 2023

tgeoghegan mentioned this issue Apr 20, 2023

Revisit request authentication #472

Closed

tgeoghegan closed this as completed Jun 16, 2023

tgeoghegan added the byohelper needed for BYOHelper and self-service task provisioning (#1486) label Jun 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provisioning of tasks into the database #44

Provisioning of tasks into the database #44

tgeoghegan commented Mar 28, 2022

tgeoghegan commented Mar 28, 2022

divergentdave commented Jun 16, 2022

branlwyd commented Jun 16, 2022

tgeoghegan commented Dec 14, 2022

tgeoghegan commented Jun 16, 2023

Provisioning of tasks into the database #44

Provisioning of tasks into the database #44

Comments

tgeoghegan commented Mar 28, 2022

tgeoghegan commented Mar 28, 2022

divergentdave commented Jun 16, 2022

branlwyd commented Jun 16, 2022

tgeoghegan commented Dec 14, 2022

tgeoghegan commented Jun 16, 2023