Restore: rework progress aggregation #4225

Michal-Leszczynski · 2025-01-23T14:11:54Z

This PR does the following:

reworks progress aggregation to rely on iter.Seq instead of direct DB access, which in turn allows for adding unit tests for progress aggregation
fills the RestoredBytes and RestoreDuration introduced in Swagger: scylla-manager, move towards just restored bytes #4214

Note, the RestoredBytes and RestoreDuration need to be added to the sctool progress display as a part of a separate PR.

This column might be interesting for backup which performs deduplication, but it does not make any sense for restore. It wasn't even a part of restore progress API, so it's safe to remove it.

This column wasn't needed with the sync load&stream Scylla API (restored is equal to either 0 or downloaded), but it is cleaner to have it and it can be useful for native restore Scylla API.

Followup to the e112747. In the past we treated RestoredStartedAt/RestoreCompletedAt as the time frame for l&s, but we should look at it more inclusively, so as the time frame of restoring bytes, which for Rclone API + l&s approach starts with the download.

Previously, the code aggregating restore progress and the code querying the run progress for aggregation were tightly tangled. This resulted in difficulties in writing unit tests and a more confusing code overall. This commit changes progress aggregation to use iterator over run progresses, which can be easily mocked in unit tests.

Followup to the e112747.

It is important to specify now when testing.

Thanks to the previous commit, it is now possible to add unit tests for progress aggregation!

VAveryanov8

Nice usage of iterators and tests!

Michal-Leszczynski · 2025-01-27T10:16:02Z

Thanks @VAveryanov8!
I'm just wondering, which approach to iterator that can encounter an error do you prefer?

The one from this PR - iterator stores the error internally and it needs to be checked manually after iteration (similar to gocql iterator). It allows to hide the error handling from other parts of the code (might be good or bad).
Instead of iter.Seq[K], use iter.Seq2[K, error] which allows for handling errors by the underlying code.

I think that the second is probably better in general, but the first approach seamed more tailored to this PR use case.

VAveryanov8 · 2025-01-27T10:30:11Z

The one from this PR - iterator stores the error internally and it needs to be checked manually after iteration (similar to gocql iterator). It allows to hide the error handling from other parts of the code (might be good or bad).
Instead of iter.Seq[K], use iter.Seq2[K, error] which allows for handling errors by the underlying code.
I think that the second is probably better in general, but the first approach seamed more tailored to this PR use case.

Good question! I don't have a strong opinion on this - iterators are quite new to the language and there are not that many projects that already using them.
In general second approach is more flexible (err can be skipped and etc), but in this particular case I think what you did looks better.

Michal-Leszczynski added 7 commits January 23, 2025 15:04

refactor(schema): remove unused restore_run_progress skipped column

bea4928

This column might be interesting for backup which performs deduplication, but it does not make any sense for restore. It wasn't even a part of restore progress API, so it's safe to remove it.

feat(schema): add restore_run_progress restored column

925b932

This column wasn't needed with the sync load&stream Scylla API (restored is equal to either 0 or downloaded), but it is cleaner to have it and it can be useful for native restore Scylla API.

feat(restore): fill HostProgress RestoredBytes/Duration

284fb80

Followup to the e112747.

feat(restore): allow for specifying now in progress aggregation

5d19e3f

It is important to specify now when testing.

feat(restore): add unit tests for progress aggregation

b4cb53d

Thanks to the previous commit, it is now possible to add unit tests for progress aggregation!

Michal-Leszczynski force-pushed the ml/rework-progress branch from eba968f to b4cb53d Compare January 23, 2025 15:32

Michal-Leszczynski marked this pull request as ready for review January 24, 2025 09:59

Michal-Leszczynski requested a review from karol-kokoszka as a code owner January 24, 2025 09:59

Michal-Leszczynski requested a review from VAveryanov8 January 24, 2025 09:59

VAveryanov8 approved these changes Jan 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restore: rework progress aggregation #4225

Restore: rework progress aggregation #4225

Michal-Leszczynski commented Jan 23, 2025

VAveryanov8 left a comment

Michal-Leszczynski commented Jan 27, 2025

VAveryanov8 commented Jan 27, 2025

Restore: rework progress aggregation #4225

Are you sure you want to change the base?

Restore: rework progress aggregation #4225

Conversation

Michal-Leszczynski commented Jan 23, 2025

VAveryanov8 left a comment

Choose a reason for hiding this comment

Michal-Leszczynski commented Jan 27, 2025

VAveryanov8 commented Jan 27, 2025