Improve shadowing execution #28

KineticCookie · 2020-06-25T09:11:15Z

Improve A/B execution by defining return value BEFORE execution happens.

Valenzione · 2020-08-11T08:35:52Z

I'll throw in a little bit more context to let this be a good-first-issue.

Hydro-serving is able to shadow data between multiple model variants in a serving application.

 i.e. A 5% canary test can look like this

Application ‘A’
	|
	| - Variant 1: model ‘a’ version 1. weight=95
	| - Variant 2: model ‘a’ version 2. weight=5

How shadowing is done:

Whenever a serving application endpoint receives a request it shadows received data to all model variants for processing.
Only after all model variants produce outputs we choose an output from one of these models randomly, according to the weights associated with each of these model variants.

Thus, we shadow incoming data to all model variants but return output only from a single one.

Since we wait for all model variants to finish output calculation we are left with incorrect latency which is a maximum latency of all model variants.

To improve throughput and calculate latency properly per each model variant we need to stop waiting for all model variants to produce their outputs and choose the model which output will be returned before outputs are calculated.

Valenzione added the good first issue Good for newcomers label Aug 11, 2020

Valenzione added the Hacktoberfest label Oct 2, 2020

KineticCookie removed the Hacktoberfest label Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve shadowing execution #28

Improve shadowing execution #28

KineticCookie commented Jun 25, 2020

Valenzione commented Aug 11, 2020

Improve shadowing execution #28

Improve shadowing execution #28

Comments

KineticCookie commented Jun 25, 2020

Valenzione commented Aug 11, 2020