[DO NOT MERGE] Review of already merged code #1

arcusfelis · 2024-09-05T10:18:01Z

We have to do coding in the main branch, so gh-pages are built.

Still, we could do a basic code review.

Here is a diff PR.

You can review it, just do not merge it, please.

So, how it works.
404.html handles any not-found-pages by default in gh-pages.
In 404.html we have the code "for control interface". Control interface allows to install/uninstall worker and view logs from the worker.

404.html would be hit when a user first time presses a link to the file in archive. It would install the service worker and reload the page.

Once service worker is installed, it would handle onfetch method.

Fetch allows to handle requests to the domain, in a way similar to how proxy works.
So, if we see in the method, that a user wants to read something from the archive, we:

download the tar.gz archive.
we use DecompressionStream to decompress gz. Standard from 2023, just in time for us :)
it is in tar.gz format, so we use tar-stream (or tar-web repack of it), to read tar.
We could just put files from tar into ServiceWorker Cache. Cache has put and match methods.
Except version that uses tar-web to read files content and put it into cache takes 25 seconds... oops.
So, we use tee to make two readable streams, instead of one.
One stream we use to collect filenames, file sizes and file offsets in the tar. The result JSON we store.
Another stream we use to get the Tar blob and a binary. But we also split that blob into 1 megabyte parts, so we suddenly do not allocate 170MB of memory.
We use cache API to write tar parts and files.json. So, around 20 writes to cache instead of 9000 writes :)
In fetch we decide if the file is in archive, get size and offset, figure out which parts we need and read the data from file. And we create Response with the data. If not found, we write "oops, not found" if file is supposed to be in the archive. Otherwise we just allow browser to ask github pages and ignore the worker.

NelsonVides

Amazing, I didn't know you could do so many things on the browser. Very well done, readable code and everything! 😄

NelsonVides · 2024-09-05T18:45:24Z

Oh and that PR description, gorgeous!

Optimize (compress) ct_report logs on CI and read them using a service worker in Browser This PR addresses MIM-2283 "Optimize (compress) ct_report logs somehow on CI" Proposed changes include: Compress files into tar.gz. We use https://github.com/esl/html-zip-reader to read the archive and serve it using a service worker. The logic for the service worker could be reviewed in this PR: esl/html-zip-reader#1 We can maybe stop using s3-parallel-put (unless it is smaller than awscli tools). But in a separate PR :)

arcusfelis added 24 commits August 28, 2024 17:02

Add 404

8bbc339

First build

873c022

Make index

40bb6a7

Collect tar parts

8a1e139

It takes 2 seconds to parse now

66310b4

Check in prod

b41345c

Remove prefix from filename

d5773cd

Store last part properly

6be7564

Remove images from the example

72509d9

Start swListener earlier

c3f01eb

Properly show control interface

e07feb5

Use swLog everywhere in worker

4517590

Add autoinstall

fb9cdf0

Do install after checking worker status

25476fd

Handle empty path in archive

2ab5ab9

Log tarPath

e6e06d8

Redirect to index.html in archive if no path found

370a2bb

Handle ? when redirecting

1bab9c2

Only do redirect if worker installed

496f02e

Log about autoredirect

2b2a8f5

Update README

7124b5b

Use esl instead of arcusfelis in paths

9363c77

Remove blob-stream.js

04a6530

Decode path in archive

8a09cd4

arcusfelis mentioned this pull request Sep 5, 2024

Optimize (compress) ct_report logs on CI and read them using a service worker in Browser esl/MongooseIM#4367

Merged

NelsonVides approved these changes Sep 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT MERGE] Review of already merged code #1

[DO NOT MERGE] Review of already merged code #1

arcusfelis commented Sep 5, 2024 •

edited

Loading

NelsonVides left a comment

NelsonVides commented Sep 5, 2024

[DO NOT MERGE] Review of already merged code #1

Are you sure you want to change the base?

[DO NOT MERGE] Review of already merged code #1

Conversation

arcusfelis commented Sep 5, 2024 • edited Loading

NelsonVides left a comment

Choose a reason for hiding this comment

NelsonVides commented Sep 5, 2024

arcusfelis commented Sep 5, 2024 •

edited

Loading