[s3 cache] Faster download with parallelism #5604

bpaquet · 2024-12-17T21:02:11Z

No description provided.

bpaquet · 2024-12-17T21:11:31Z

I will deploy this PR internally to check how it behaves at scale, but a review is always interesting

tonistiigi

So how much memory does this use?

I wonder if instead of trying to get this functionality behind ReadAt call (where it really does not belong because it is only for a specific range), this return value could instead implement https://pkg.go.dev/io#WriterTo that could be detected on the caller side where it is writing the blob to disk (maybe even via WriteAt()).

tonistiigi · 2024-12-17T21:49:54Z

cache/remotecache/s3/s3_downloader.go

+	input *S3DownloaderInput
+}
+
+type S3DownloaderInput struct {


These don't need to be public if only used by private functions.

tonistiigi · 2024-12-17T21:51:00Z

cache/remotecache/s3/s3_downloader.go

+
+func newDownloader(input *S3DownloaderInput) *S3Downloader {
+	if input.Parallelism == 0 {
+		input.Parallelism = 8


You can test this, but for registry layers, our default is 4.

These default are not really used, they are overriden in s3.go, where the default value is 4. I changed them anyway.

Signed-off-by: Bertrand Paquet <[email protected]>

bpaquet · 2024-12-18T07:34:40Z

So how much memory does this use?

I wonder if instead of trying to get this functionality behind ReadAt call (where it really does not belong because it is only for a specific range), this return value could instead implement https://pkg.go.dev/io#WriterTo that could be detected on the caller side where it is writing the blob to disk (maybe even via WriteAt()).

If we do the hypothesis the solver will read faster than we can download from s3, we will use parallelism * part size, so 20MB by default.

We can also build a wrapper to "convert" the WriterAt from the SDK to a ReaderAt, but it seems more complicated to me than this solution, especially from a memory allocation point of view. We can anyway consider is if you think it's better.

mlevieux · 2024-12-18T09:19:23Z

cache/remotecache/s3/s3.go

+		if downloadPartSizeInt <= 0 {
+			return Config{}, errors.Errorf("download_part_size must be a positive integer")
+		}
+		downloadParallism = downloadPartSizeInt


Suggested change

downloadParallism = downloadPartSizeInt

downloadPartSize = downloadPartSizeInt

Right?

mlevieux · 2024-12-18T09:41:43Z

cache/remotecache/s3/s3_downloader.go

+	for i := 0; i < r.totalChunk; i++ {
+		r.inChan <- i
+	}
+	for k := 0; k < r.input.Parallelism; k++ {


Nit: since go1.22, you can now use the quite shorter range-over-integer variant:

Suggested change

for i := 0; i < r.totalChunk; i++ {

r.inChan <- i

}

for k := 0; k < r.input.Parallelism; k++ {

for i := range r.totalChunk {

r.inChan <- i

}

for range r.input.Parallelism {

See this playground link

mlevieux · 2024-12-18T10:34:41Z

cache/remotecache/s3/s3_downloader.go

+	for {
+		if r.chunks[r.currentChunk].done {
+			break
+		}
+		err := <-r.outChan
+		if err != nil {
+			r.Close()
+			return 0, err
+		}
+	}


I would most probably replace done with an unbuffered chan struct{} and go with (untested):

Suggested change

for {

if r.chunks[r.currentChunk].done {

break

}

err := <-r.outChan

if err != nil {

r.Close()

return 0, err

}

}

readyOrConsume:

for {

select {

case <-r.chunks[r.currentChunk].done:

// this chunk is okay, can process

break readyOrConsome

case err := <-r.outChan:

// process error in case there is any

}

}

And close the done channel from the downloadChunk function before returning.

Which IMO looks more idiomatic and removes the race on done (even if I assume this particular race wouldn't have any side effect).

mlevieux · 2024-12-18T10:52:47Z

cache/remotecache/s3/s3_downloader.go

+	if err != nil {
+		return err
+	}
+	n, err := io.Copy(&r.chunks[chunk], resp.Body)


Nit: for the sake of performance / brevity, I would suggest going with a one-time copy of the result of io.ReadAll(resp.Body) into r.chunks[chunk]. This would avoid the need to reference r.chunks[chunk] to get a valid io.Writer implementation, along with the need for chunkStatus to even implement io.Writer. Also that would remove the need for a writeOffset.

No strong opinion however if you think this is clearer like that.

mlevieux · 2024-12-18T10:57:57Z

cache/remotecache/s3/s3_downloader.go

+	size        int64
+	start       int64
+	buffer      []byte
+	io.Writer


Quick note: this actually declares an anonymous struct field with type io.Writer, which I believe is not what you're looking for, since this field's Write method is shadowed by that of chunkStatus below anyways.

If what you're trying to do is to make explicit and enforce the fact that chunkStatus implements io.Writer, the more idiomatic way to go is to add this line somewhere in the file (most generally this is below the struct declaration itself):

var _ io.Writer = &chunkStatus{}

But Go's interface implementation being implicit, this is not even necessary.

mlevieux · 2024-12-18T11:01:12Z

cache/remotecache/s3/s3_downloader.go

+}
+
+type s3Reader struct {
+	content.ReaderAt


Same as for io.Writer above.

mlevieux · 2024-12-18T12:20:20Z

cache/remotecache/s3/s3.go

@@ -141,20 +145,48 @@ func getConfig(attrs map[string]string) (Config, error) {
 		uploadParallelism = uploadParallelismInt
 	}

+	downloadParallism := 4


Also ultra nitpick:

Suggested change

downloadParallism := 4

downloadParallelism := 4

github-actions bot added the area/remotecache label Dec 17, 2024

bpaquet force-pushed the parallel_download branch 2 times, most recently from e47360e to 8cee779 Compare December 17, 2024 21:06

bpaquet changed the title ~~Implement a parallel download on S3~~ [s3 cache] Faster download with parallelism Dec 17, 2024

bpaquet marked this pull request as ready for review December 17, 2024 21:11

tonistiigi reviewed Dec 17, 2024

View reviewed changes

Implement a parallel download on S3

007f08e

Signed-off-by: Bertrand Paquet <[email protected]>

bpaquet force-pushed the parallel_download branch from 8cee779 to 007f08e Compare December 18, 2024 07:32

mlevieux reviewed Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[s3 cache] Faster download with parallelism #5604

[s3 cache] Faster download with parallelism #5604

bpaquet commented Dec 17, 2024

bpaquet commented Dec 17, 2024

tonistiigi left a comment

tonistiigi Dec 17, 2024

bpaquet Dec 18, 2024

tonistiigi Dec 17, 2024

bpaquet Dec 18, 2024

bpaquet commented Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

mlevieux Dec 18, 2024

	downloadParallism = downloadPartSizeInt
	downloadPartSize = downloadPartSizeInt

-	for {
-		if r.chunks[r.currentChunk].done {
-			break
-		}
-		err := <-r.outChan
-		if err != nil {
-			r.Close()
-			return 0, err
-		}
-	}
+        readyOrConsume:
+	for {
+	        select {
+	        case <-r.chunks[r.currentChunk].done:
+	                // this chunk is okay, can process
+	                break readyOrConsome
+	        case err := <-r.outChan:
+	                // process error in case there is any
+	        }
+	}

[s3 cache] Faster download with parallelism #5604

Are you sure you want to change the base?

[s3 cache] Faster download with parallelism #5604

Conversation

bpaquet commented Dec 17, 2024

bpaquet commented Dec 17, 2024

tonistiigi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bpaquet commented Dec 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment