Retention fix distributed #770

nikhilsinhaparseable · 2024-04-20T03:29:29Z

fix for retention for distributed deployment
fix for stream info api

Fixes #768

Description

This PR has:

been tested to ensure log ingestion and log query works.
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added documentation for new or modified features or behaviors.

… mismatch data directory creation should not happen in case of deployment mismatch staging should be overwritten in case of new staging

… mismatch data directory creation should not happen in case of deployment mismatch staging should be overwritten in case of new staging default staging and data directory should not be created if env var has different path

…en staging and remote

Co-authored-by: Nitish Tiwari <[email protected]>

…blehq#629)

Bumps [h2](https://github.com/hyperium/h2) from 0.3.17 to 0.3.24. - [Release notes](https://github.com/hyperium/h2/releases) - [Changelog](https://github.com/hyperium/h2/blob/v0.3.24/CHANGELOG.md) - [Commits](hyperium/h2@v0.3.17...v0.3.24) --- updated-dependencies: - dependency-name: h2 dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

…ehq#623) fixes parseablehq#615

Standardise the error message and also use the new logg.ing domain for short URLs.

…eablehq#634) Add P_MODE with options `ingest`, `query` and `all`. Default mode is `all`. There are still more changes required for both modes to work well. Will be added in next subsequent PRs. Fixes parseablehq#617

create parquet file by grouping all arrow files (in staging) for the duration provided in env variable P_STORAGE_UPLOAD_INTERVAL also check if arrow files vector is not empty, then sort the arrow files and create key for parquet file from last file from sorted arrow files vector Fixes parseablehq#616

Previously in Query Mode, All log stream endpoints were allowed. But is it better that only ingester is allowed to create streams. Fixes parseablehq#641

Signed-off-by: Nitish Tiwari <[email protected]>

Earlier, separate scheduler was initialized for each stream on load time or whenever retention period is set. Now, a single scheduler is initialized which checks retention config of all the streams and performs the retention cleanup. Fixes parseablehq#636

…blehq#653) added const of 60 secs to be used for local to storage sync fixes parseablehq#651

fixes: parseablehq#312

fixes parseablehq#650

…parseablehq#661) fixes parseablehq#654

fixes parseablehq#660

includes console release v0.4.0

It is better that users generate themselves as needed. Signed-off-by: Nitish Tiwari <[email protected]>

…snapshot" (parseablehq#666) Reverts parseablehq#661 because with this change we're backward incompatible with older versions where .stream.json doesn't contain retention field. This reverts commit 121bf01.

Signed-off-by: Nitish Tiwari <[email protected]>

Changes does in the PR - 1. adds the first_event_at property (from the min value of p_timestamp of the first parquet file listed in the first manifest file from the snapshot of the stream.json) to the stats api and writes it to the stream.json file at the request of get stats. 2. updates the first_event_at in case of retention Fixes : parseablehq#587

…ablehq#730)

delete ingester if offline

* fix: stats response * fix: s3 get objects Refactor object storage to filter objects by starts_with_pattern

…#734) * Add node_url field to Cli struct and update related code * Update required flag for Node URL in CLI * updated logic to have server address (ip:port) in parquet file name similar to other json files --------- Co-authored-by: Nikhil Sinha <[email protected]>

* remove staging query from the query result (for distributed) * Refactor get_schema method to handle missing schema in object storage

Bumps [h2](https://github.com/hyperium/h2) from 0.3.24 to 0.3.26. - [Release notes](https://github.com/hyperium/h2/releases) - [Changelog](https://github.com/hyperium/h2/blob/v0.3.26/CHANGELOG.md) - [Commits](hyperium/h2@v0.3.24...v0.3.26) --- updated-dependencies: - dependency-name: h2 dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* fix analytics for the cluster * Add active_ingesters and inactive_ingesters metrics * updated ingesters' count and event related metrics

This PR adds fixes for 1. Default role not assigned to the OAuth user if group does not exist 2. Use user name used instead of id fixes parseablehq#638 fixes parseablehq#868

Signed-off-by: Nitish Tiwari <[email protected]>

fix for staging size metrics not resetting to 0 even when local to storage sync is completed and no arrow/parquet file is left in staging folder

* Refactor object storage to use filter_func instead of starts_with_pattern in get_objects method * Refactor fetch_schema method to use object storage instead of HTTP requests * Refactor metadata.rs and storage.rs * refactor ingest logic * fetch stream info from store if stream info is not present in memory. error if stream info does not exist in S3 and memory

Co-authored-by: Nikhil Sinha <[email protected]>

…seablehq#746)

1. fixed banner spacing 2. modified server mode: All to Standalone, Ingest to Distributed (Ingest), Query to Distributed (Query) 3. updated server mode in about API response 4. updated logic for env var P_INGESTOR_URL to use HOSTNAME and PORT from env 5. remove put cache api from querier 6. added put cache api to ingestor 7. renamed ingester to ingestor 8. corrected cache flow for ingestors and standalone 9. removed query, other logstream apis for ingestors

also fixed P_INGESTOR_URL fetch from env variables

with this PR, when delete stream is called, querier deletes the stream folder from the storage then calls delete stream API for each ingestor. Finally, ingestor deletes the stream from its local map

This PR ensures all metadata and data files (json and parquet) use a simple sha256 based hash name mechanism. Each ingestor allocates itself a unique hash which is used in all file names relevant to that ingestor. This hash is persisted in metadata file content also and is supposed to be the same for the lifecycle of the ingestor. --------- Co-authored-by: Nikhil Sinha <[email protected]>

caching for distributed mode can be enabled from querier UI. Querier calls the PUT /cache API to all ingestors. An ingestor, first checks if stream exists, if not found in local map, checks in S3 and creates stream. Then checks if caching env vars are set. If yes, add cache_enabled flag to STREAM_INFO and update its stream.json in S3 Fixes: parseablehq#764

fix for stream info api

nikhilsinhaparseable and others added 30 commits January 15, 2024 18:30

fix for parseablehq#573 corrected error message in case of deployment…

e70e144

… mismatch data directory creation should not happen in case of deployment mismatch staging should be overwritten in case of new staging

cargo fmt and clippy fix

f8780ee

changed logic to throw exception in case of deployment mismatch betwe…

6b21462

…en staging and remote

Merge branch 'parseablehq:main' into main

505bf01

temp: allow configurable buffer size while ingestion (parseablehq#624)

e575a41

Co-authored-by: Nitish Tiwari <[email protected]>

Revert "Mutex Poison Error if a large amount of data is sent" (parsea…

c6e71fd

…blehq#629)

fix: proper error Message if storage is in an invalid state (parseabl…

9b6e179

…ehq#623) fixes parseablehq#615

fix: clean up storage validation logic further (parseablehq#633)

007fd88

Standardise the error message and also use the new logg.ing domain for short URLs.

fix: allow only GET /logstream in query mode (parseablehq#643)

fe860bd

Previously in Query Mode, All log stream endpoints were allowed. But is it better that only ingester is allowed to create streams. Fixes parseablehq#641

Update CNAME

3e017bd

Signed-off-by: Nitish Tiwari <[email protected]>

update helm chart to correct domain (parseablehq#647)

14d1da2

remove collector from helm chart repo (parseablehq#648)

1ef78ea

fix: removed use of environment var P_STORAGE_UPLOAD_INTERVAL (parsea…

f873fd1

…blehq#653) added const of 60 secs to be used for local to storage sync fixes parseablehq#651

add license details with cargo-about (parseablehq#658)

e742a2e

feat: support OpenTelemetry log flattening (parseablehq#657)

b806697

fixes: parseablehq#312

fix: absolute url for windows (parseablehq#655)

9b192a1

fixes parseablehq#650

fix: avoid removal of retention configuration while updating snapshot (…

8f180a8

…parseablehq#661) fixes parseablehq#654

fix: retention cleanup on server startup (parseablehq#662)

0e7fe60

fixes parseablehq#660

prepare for release v0.8.0 (parseablehq#663)

31f6efc

includes console release v0.4.0

update cargo-about details (parseablehq#664)

7c060c6

delete parseable-license.html (parseablehq#665)

c8bbaa6

It is better that users generate themselves as needed. Signed-off-by: Nitish Tiwari <[email protected]>

Revert "fix: avoid removal of retention configuration while updating …

898773b

…snapshot" (parseablehq#666) Reverts parseablehq#661 because with this change we're backward incompatible with older versions where .stream.json doesn't contain retention field. This reverts commit 121bf01.

prepare for bugfix release v0.8.1 (parseablehq#668)

5ca0ab5

do not lock PRs after merge (parseablehq#674)

f522d95

Signed-off-by: Nitish Tiwari <[email protected]>

Eshanatnight and others added 29 commits April 20, 2024 00:20

feat: implement distributed system with ingest and query modes (parse…

ae08d56

…ablehq#730)

fix: delete node endpoint (parseablehq#731)

2dea510

delete ingester if offline

fix: stats response (parseablehq#733)

a686d81

* fix: stats response * fix: s3 get objects Refactor object storage to filter objects by starts_with_pattern

update to latest Console and Rust version (parseablehq#739)

f828cbb

Rm staging query result (parseablehq#740)

8c7bb86

* remove staging query from the query result (for distributed) * Refactor get_schema method to handle missing schema in object storage

fix analytics for the cluster (parseablehq#732)

4aac841

* fix analytics for the cluster * Add active_ingesters and inactive_ingesters metrics * updated ingesters' count and event related metrics

Fix: OAuth User to Role Mapping Fix (parseablehq#742)

87e9a08

This PR adds fixes for 1. Default role not assigned to the OAuth user if group does not exist 2. Use user name used instead of id fixes parseablehq#638 fixes parseablehq#868

cleaned up readme for better readability (parseablehq#738)

14b12f1

Update README.md

30c90b5

Signed-off-by: Nitish Tiwari <[email protected]>

removing ingester liveness check at query server start (parseablehq#744)

df03bc7

fix for staging size metrics not resetting to 0 (parseablehq#743)

acbc92d

fix for staging size metrics not resetting to 0 even when local to storage sync is completed and no arrow/parquet file is left in staging folder

fix: querier to use manifest list from all ingestors (parseablehq#747)

8ff0e45

rename P_NODE_URL to P_INGESTOR_URL (parseablehq#748)

bc58ca3

Co-authored-by: Nikhil Sinha <[email protected]>

fix extract_cookie function to handle grpc cookies in the header (par…

4fe0249

…seablehq#746)

fix for GET /cache API (parseablehq#754)

fa3174a

also fixed P_INGESTOR_URL fetch from env variables

update to latest console v0.6.1 (parseablehq#755)

09606b9

fixed liveness and readiness api to use base path (parseablehq#757)

6fd9c36

fix: readiness api to use correct JSON path (parseablehq#758)

b9057c5

fix: add delete stream API in distributed mode (parseablehq#766)

86620bd

with this PR, when delete stream is called, querier deletes the stream folder from the storage then calls delete stream API for each ingestor. Finally, ingestor deletes the stream from its local map

fix: updated code to fix retention for standalone deployments

918c3b0

fix for retention for distributed deployments - WIP

96ef239

fix for retention for distributed deployment

052c6de

fix for stream info api

fix for get stream info API

d0d0ac8

nikhilsinhaparseable closed this Apr 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Retention fix distributed #770

Retention fix distributed #770

Uh oh!

nikhilsinhaparseable commented Apr 20, 2024

Uh oh!

Uh oh!

Uh oh!

Retention fix distributed #770

Retention fix distributed #770

Uh oh!

Conversation

nikhilsinhaparseable commented Apr 20, 2024

Description

Uh oh!

Uh oh!