Skip to content

[docs] add docs to integrate Fluss + Iceberg via Flink with AWS Glue and Hive#3424

Open
qzyu999 wants to merge 1 commit into
apache:mainfrom
qzyu999:issue-2616
Open

[docs] add docs to integrate Fluss + Iceberg via Flink with AWS Glue and Hive#3424
qzyu999 wants to merge 1 commit into
apache:mainfrom
qzyu999:issue-2616

Conversation

@qzyu999
Copy link
Copy Markdown

@qzyu999 qzyu999 commented Jun 3, 2026

Purpose

Linked issue: close #2616

This pull request introduces comprehensive integration guides for using AWS Glue and Hive Metastore catalogs when tiering Fluss streaming data to Apache Iceberg. This completes the Iceberg Data Lake Catalogs documentation suite under docs/streaming-lakehouse/integrate-data-lakes/catalogs/.

Brief change log

  • Added AWS Glue Catalog integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/catalogs/glue.md): Documents AWS IAM policy template, required catalog runtime JAR dependencies, server.yaml cluster configurations, Flink tiering service commands, and Amazon Athena query verification.
  • Added Hive Metastore integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/catalogs/hive.md): Documents Hive Metastore Thrift connection options, required Hadoop client and Hive runtime classpath dependencies, HADOOP_CLASSPATH configuration, Flink tiering commands, and Spark SQL query verification.
  • Updated main Iceberg integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/formats/iceberg.md): Added catalog-specific cross-links for hive (linking to Hive Metastore), glue (linking to AWS Glue), and rest (linking to Lakekeeper).

Note: Changes are based on the existing lakekeeper.md as a template, and references were based on existing code and online/offline documentation. The actual AWS Glue/HMS implementations have not yet been tested by the developer.

Tests

  • Built the entire documentation site locally using npm run build to verify the page output and ensure there are no broken links (meeting Docusaurus build validation).
  • Verified the rendering, layouts, and link references using the local development server at http://localhost:3000/docs/next/streaming-lakehouse/integrate-data-lakes/formats/iceberg/.

API and Format

This is a documentation-only change. It does not affect any public API or storage formats.

Documentation

This pull request introduces new documentation guides under the Docusaurus website subfolder. No changes were made to code-level Javadocs.

  • Generative AI disclosure:
    • Yes (Antigravity AI Assistant, reviewed by human developer)

…Data Lake Catalogs section which includes Lakekeeper
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Lake/Iceberg] create doc to integrate fluss + Iceberg via flink with aws glue

1 participant