[docs] add docs to integrate Fluss + Iceberg via Flink with AWS Glue and Hive by qzyu999 · Pull Request #3424 · apache/fluss

qzyu999 · 2026-06-03T04:43:39Z

Purpose

Linked issue: close #2616

This pull request introduces comprehensive integration guides for using AWS Glue and Hive Metastore catalogs when tiering Fluss streaming data to Apache Iceberg. This completes the Iceberg Data Lake Catalogs documentation suite under docs/streaming-lakehouse/integrate-data-lakes/catalogs/.

Brief change log

Added AWS Glue Catalog integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/catalogs/glue.md): Documents AWS IAM policy template, required catalog runtime JAR dependencies, server.yaml cluster configurations, Flink tiering service commands, and Amazon Athena query verification.
Added Hive Metastore integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/catalogs/hive.md): Documents Hive Metastore Thrift connection options, required Hadoop client and Hive runtime classpath dependencies, HADOOP_CLASSPATH configuration, Flink tiering commands, and Spark SQL query verification.
Updated main Iceberg integration guide (website/docs/streaming-lakehouse/integrate-data-lakes/formats/iceberg.md): Added catalog-specific cross-links for hive (linking to Hive Metastore), glue (linking to AWS Glue), and rest (linking to Lakekeeper).

Note: Changes are based on the existing lakekeeper.md as a template, and references were based on existing code and online/offline documentation. The actual AWS Glue/HMS implementations have not yet been tested by the developer.

Tests

Built the entire documentation site locally using npm run build to verify the page output and ensure there are no broken links (meeting Docusaurus build validation).
Verified the rendering, layouts, and link references using the local development server at http://localhost:3000/docs/next/streaming-lakehouse/integrate-data-lakes/formats/iceberg/.

API and Format

This is a documentation-only change. It does not affect any public API or storage formats.

Documentation

This pull request introduces new documentation guides under the Docusaurus website subfolder. No changes were made to code-level Javadocs.

Generative AI disclosure:
- Yes (Antigravity AI Assistant, reviewed by human developer)

…Data Lake Catalogs section which includes Lakekeeper

docs: add Hive Metastore and AWS Glue catalog guides into a separate …

bcd2c93

…Data Lake Catalogs section which includes Lakekeeper

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] add docs to integrate Fluss + Iceberg via Flink with AWS Glue and Hive#3424

[docs] add docs to integrate Fluss + Iceberg via Flink with AWS Glue and Hive#3424
qzyu999 wants to merge 1 commit into
apache:mainfrom
qzyu999:issue-2616

qzyu999 commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

qzyu999 commented Jun 3, 2026

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant