"Current" recall is not actually current but cumulative so far #108

daverigby · 2024-06-14T16:48:19Z

We now report the current latency and recall during the run phase - e.g.

Performing Run phase  6990/10000 ━━━━━━━━━━━━━━━━━━━━━━━━━━━╸━━━━━━━━━━━━  70% elapsed: 0:01:19 remaining: 00:34 Current latency: p5=41ms, p95=52ms | Current recall: p50=0.99, p5=0.97

The current latency figures come from locust itself, which calcualates this from the last 10s worth of samples - so it is indeed a "current" metric.

However the recall figures are recorded by VSB itself from a single HdrHistogram instance, and hence they are the cumulative recall values so far.

This is misleading, as they won't readily adopt to changes to the performance of recall during the experiment.

To address this we need to do something similar to _cache_response_times - retain N copies of the histogram from the last N seconds and use the difference between T=now and T-10 to display them.

The text was updated successfully, but these errors were encountered:

Add progress bars for each of the main phases of an experiment - Setup, Populate and Run. For Populate and Run we require the total number of records / queries to show a useful end progress so far. For the Run phase we include the current latency and recall(*) values, these require additional metrics of how many records have been upserted so far (note that we generally upsert in batches, so the existing number of Population requests is not sufficient. (*) For recall we cannot calculate the current metric (last 10s), as we only have a single histogram accumulating the results - instead this shows the overall recall so far. There's an improvement raised to fix this (#108).

Add progress bars for each of the main phases of an experiment - Setup, Populate and Run. For Populate and Run we require the total number of records / queries to show a useful end progress so far. For the Populate phase we include the current rate of upsert. For the Run phase we include the current latency and recall(*) values, these require additional metrics of how many records have been upserted so far (note that we generally upsert in batches, so the existing number of Population requests is not sufficient. (*) For recall we cannot calculate the current metric (last 10s), as we only have a single histogram accumulating the results - instead this shows the overall recall so far. There's an improvement raised to fix this (#108).

daverigby added the bug Something isn't working label Jun 14, 2024

daverigby added this to the Phase 2: More workloads, more databases milestone Jun 14, 2024

daverigby mentioned this issue Jun 14, 2024

Add progress bars for main phases #109

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Current" recall is not actually current but cumulative so far #108

"Current" recall is not actually current but cumulative so far #108

daverigby commented Jun 14, 2024

"Current" recall is not actually current but cumulative so far #108

"Current" recall is not actually current but cumulative so far #108

Comments

daverigby commented Jun 14, 2024