Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SUPPORT] Wrong file path for commit-marker-file in Consistent-Bucket-Index layout #12338

Open
TheR1sing3un opened this issue Nov 26, 2024 · 0 comments
Labels

Comments

@TheR1sing3un
Copy link
Member

In Consistent-Bucket-Index file layout:
there are two type of file for each versioned bucket metadata:

  1. {instant}.hashing_metadata: created in each bucket resizing.
  2. {instant}.commit: created after instant is completed.
image

All these two type of files should all be stored in this path:
image
When loadMetadata, will load all these two type of files from above path.
image
But, I find that the {instant}.commit file will not be stored in this hash-metadata path.
image
This file will be stored in data path, which is {basePath}/{partitionPath}/{instant}.commit. So when ConsistentBucketIndexUtils load metadata for a partition, all files of this type will not be loaded because they are stored in the wrong location.

For example:

.hashing_metadata

image

.commit

image

Tips before filing an issue

  • Have you gone through our FAQs?

  • Join the mailing list to engage in conversations and get faster support at [email protected].

  • If you have triaged this as a bug, then file an issue directly.

Describe the problem you faced

A clear and concise description of the problem.

To Reproduce

Steps to reproduce the behavior:

Expected behavior

A clear and concise description of what you expected to happen.

Environment Description

  • Hudi version :

  • Spark version :

  • Hive version :

  • Hadoop version :

  • Storage (HDFS/S3/GCS..) :

  • Running on Docker? (yes/no) :

Additional context

Add any other context about the problem here.

Stacktrace

Add the stacktrace of the error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Awaiting Triage
Development

No branches or pull requests

2 participants