Skip to content

*: add analytics service docs#22556

Draft
lilin90 wants to merge 7 commits intopingcap:feature/preview-cloud-lakefrom
lilin90:cloud-lake-dir
Draft

*: add analytics service docs#22556
lilin90 wants to merge 7 commits intopingcap:feature/preview-cloud-lakefrom
lilin90:cloud-lake-dir

Conversation

@lilin90
Copy link
Member

@lilin90 lilin90 commented Mar 12, 2026

What is changed, added or deleted? (Required)

  • Newly added 1000+ documents for cloud analytics service

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)
  • v5.4 (TiDB 5.4 versions)

What is the related PR or file link(s)?

  • This PR is translated from:
  • Other reference link(s):

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@lilin90 lilin90 self-assigned this Mar 12, 2026
@lilin90 lilin90 added the area/tidb-cloud This PR relates to the area of TiDB Cloud. label Mar 12, 2026
@ti-chi-bot
Copy link

ti-chi-bot bot commented Mar 12, 2026

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@ti-chi-bot ti-chi-bot bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Mar 12, 2026
@ti-chi-bot
Copy link

ti-chi-bot bot commented Mar 12, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from lilin90. For more information see the Code Review Process.
Please ensure that each of them provides their approval before proceeding.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added missing-translation-status This PR does not have translation status info. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Mar 12, 2026
@lilin90 lilin90 added the translation/doing This PR's assignee is translating this PR. label Mar 12, 2026
@ti-chi-bot ti-chi-bot bot removed the missing-translation-status This PR does not have translation status info. label Mar 12, 2026
@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the documentation for the TiDB Cloud Lake analytics service. It introduces a vast collection of new guides, tutorials, and SQL reference content to provide users with comprehensive resources for understanding and utilizing the platform's capabilities. The additions aim to enhance user experience by offering detailed explanations across various functional areas.

Highlights

  • Extensive Documentation Added: Over 1000 new documents have been added, covering the TiDB Cloud Lake analytics service, including guides, tutorials, and SQL reference materials.
  • Comprehensive Feature Coverage: The new documentation details various aspects of the analytics service, from getting started and administration to security, data loading, AI/ML integration, performance optimization, and data management.
  • Detailed SQL Reference: A complete SQL reference section has been introduced, categorizing data types, statements (DDL, DML, Query Syntax, Administration), and functions (Bitmap, Conversion, Conditional, Numeric, Date & Time, Interval, String, Aggregate, Window, Geospatial, Full-Text Search, Structured & Semi-Structured, Vector, Hash, UUID, IP Address, Context, System, Table, Sequence, Data Anonymization, Test, Other).

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog
  • TOC-tidb-cloud-lake.md
    • Added a new Table of Contents file for the TiDB Cloud Lake documentation, structuring all new guides, tutorials, and SQL references.
  • tidb-cloud-lake/_index.md
    • Added the main index file for TiDB Cloud Lake documentation, providing an overview and learning path container for the service.
  • tidb-cloud-lake/guides/access-control.md
    • Added documentation for Access Control, explaining RBAC, DAC, Privileges, Roles, and Ownership in Databend.
  • tidb-cloud-lake/guides/aggregating-index.md
    • Added documentation for Aggregating Index, detailing its purpose, how it works, setup, supported operations, refresh strategies, and best practices.
  • tidb-cloud-lake/guides/ai-ml-integration.md
    • Added an overview of AI & ML Integration, covering external functions and the MCP Server for natural language data interaction.
  • tidb-cloud-lake/guides/ai-powered-features.md
    • Added documentation for AI-Powered Features, including AI Chat for assistance and an AI-Powered SQL Assistant.
  • tidb-cloud-lake/guides/audit-trail.md
    • Added documentation for Audit Trail, explaining available audit tables and use cases for security monitoring, compliance, and operational monitoring.
  • tidb-cloud-lake/guides/authenticate-with-aws-iam-role.md
    • Added documentation on how to authenticate with AWS IAM Role for secure S3 access in Databend Cloud.
  • tidb-cloud-lake/guides/automate-data-loading-with-tasks.md
    • Added documentation for Automating Data Loading with Tasks, including examples for scheduled copy and stream-triggered merge.
  • tidb-cloud-lake/guides/bendsql.md
    • Added comprehensive documentation for BendSQL, covering installation, user authentication, connection methods, settings, and utility commands.
  • tidb-cloud-lake/guides/cluster-key.md
    • Added documentation for Cluster Key, explaining its role in query acceleration, how it works, setup, and best practices.
  • tidb-cloud-lake/guides/compliance-security.md
    • Added documentation on Databend Security Design, covering access control, data protection, secure connectivity, encryption, and compliance certifications.
  • tidb-cloud-lake/guides/connect-to-databend-using-golang.md
    • Added documentation for connecting to Databend using Golang.
  • tidb-cloud-lake/guides/connect-to-databend-using-java.md
    • Added documentation for connecting to Databend using Java.
  • tidb-cloud-lake/guides/connect-to-databend-using-node-js.md
    • Added documentation for connecting to Databend using Node.js.
  • tidb-cloud-lake/guides/connect-to-databend-using-python.md
    • Added documentation for connecting to Databend using Python.
  • tidb-cloud-lake/guides/connect-to-databend-using-rust.md
    • Added documentation for connecting to Databend using Rust.
  • tidb-cloud-lake/guides/connect-to-databend.md
    • Added an overview for connecting to Databend, summarizing quick selection guides for SQL clients, drivers, and visualization tools.
  • tidb-cloud-lake/guides/connecting-to-databend-cloud-with-aws-privatelink.md
    • Added documentation for connecting to Databend Cloud with AWS PrivateLink, detailing benefits and setup steps.
  • tidb-cloud-lake/guides/continuous-data-pipelines.md
    • Added an overview of Continuous Data Pipelines, explaining streams and tasks for CDC flows.
  • tidb-cloud-lake/guides/dashboards.md
    • Added documentation for Dashboards, covering creation, chart types, and sharing options.
  • tidb-cloud-lake/guides/data-integration-overview.md
    • Added an overview of Data Integration, detailing supported data sources, key concepts, and task management.
  • tidb-cloud-lake/guides/data-lifecycle.md
    • Added documentation for Data Lifecycle in Databend, outlining object management, storing, querying, working with, and removing data.
  • tidb-cloud-lake/guides/data-management.md
    • Added an overview of Data Management, categorizing features like Data Lifecycle, Recovery, Protection, and Recycle.
  • tidb-cloud-lake/guides/data-protection.md
    • Added documentation for Data Protection in Databend Cloud, highlighting CDP features like Network Policies, Access Control, Time Travel, and Fail-safe.
  • tidb-cloud-lake/guides/data-purge-and-recycle.md
    • Added documentation for Data Purge and Recycle, explaining VACUUM commands and data retention settings.
  • tidb-cloud-lake/guides/data-recovery.md
    • Added documentation for Data Recovery, covering Time Travel, Fail-safe, and BendSave for backup and restore.
  • tidb-cloud-lake/guides/dbeaver.md
    • Added documentation for connecting to Databend using DBeaver, including prerequisites and connection steps for self-hosted and cloud instances.
  • tidb-cloud-lake/guides/deepnote.md
    • Added documentation for integrating Deepnote with Databend Cloud.
  • tidb-cloud-lake/guides/editions.md
    • Added documentation detailing the features of Databend Cloud's Personal, Business, and Dedicated editions.
  • tidb-cloud-lake/guides/external-ai-functions.md
    • Added documentation for External AI Functions, explaining key capabilities, how it works, and an example.
  • tidb-cloud-lake/guides/fail-safe.md
    • Added documentation for Fail-Safe, explaining its mechanisms for data recovery from object storage.
  • tidb-cloud-lake/guides/full-text-index.md
    • Added documentation for Full-Text Index, detailing its problem-solving capabilities, how it works, setup, search functions, and best practices.
  • tidb-cloud-lake/guides/geo-analytics.md
    • Added documentation for Geo Analytics, covering location tables, spatial filters, hex aggregations, traffic context, and heatmap views.
  • tidb-cloud-lake/guides/grafana.md
    • Added documentation for integrating Grafana with Databend Cloud, covering Loki Protocol and custom plugin methods.
  • tidb-cloud-lake/guides/how-data-sharing-works.md
    • Added documentation explaining how Databend Data Sharing works using ATTACH TABLE, its benefits, and underlying mechanisms.
  • tidb-cloud-lake/guides/how-fuse-engine-works.md
    • Added documentation explaining how the Fuse Engine works, covering core concepts, data writing/reading, snapshot-based features, and advanced indexing.
  • tidb-cloud-lake/guides/how-json-variant-works.md
    • Added documentation explaining how Databend JSON (Variant) works, covering storage layout, automatic indexing, and query execution.
  • tidb-cloud-lake/guides/how-optimizer-works.md
    • Added documentation explaining how the Databend Optimizer works, detailing its phases from prep and stats to refining logic and physical plan selection.
  • tidb-cloud-lake/guides/integrate-with-amazon-s3.md
    • Added documentation for integrating with Amazon S3, covering supported file formats, data source creation, and integration task setup.
  • tidb-cloud-lake/guides/integrate-with-mysql.md
    • Added documentation for integrating with MySQL, covering sync modes, prerequisites, data source creation, and integration task setup.
  • tidb-cloud-lake/guides/json-search.md
    • Added documentation for JSON & Search, covering metadata table creation and various search patterns with QUERY().
  • tidb-cloud-lake/guides/jupyter-notebook.md
    • Added documentation for integrating Jupyter Notebook with Databend, covering SQLAlchemy, ipython-sql, and Python Binding Library methods.
  • tidb-cloud-lake/guides/lakehouse-etl.md
    • Added documentation for Lakehouse ETL, covering stage creation, file inspection, COPY INTO unified tables, streams, and tasks.
  • tidb-cloud-lake/guides/load-avro.md
    • Added documentation for loading Avro files into Databend from remote HTTP URLs.
  • tidb-cloud-lake/guides/load-csv.md
    • Added documentation for loading CSV files into Databend from an internal stage.
  • tidb-cloud-lake/guides/load-from-bucket.md
    • Added documentation for loading data from an Amazon S3 bucket into Databend.
  • tidb-cloud-lake/guides/load-from-local-file.md
    • Added documentation for loading data from local files into Databend using BendSQL.
  • tidb-cloud-lake/guides/load-from-remote-file.md
    • Added documentation for loading data from remote files into Databend using glob patterns.
  • tidb-cloud-lake/guides/load-from-stage.md
    • Added documentation for loading data from user, internal, and external stages into Databend.
  • tidb-cloud-lake/guides/load-ndjson.md
    • Added documentation for loading NDJSON files into Databend from an internal stage.
  • tidb-cloud-lake/guides/load-orc.md
    • Added documentation for loading ORC files into Databend from an S3 external stage.
  • tidb-cloud-lake/guides/load-parquet.md
    • Added documentation for loading Parquet files into Databend from an internal stage.
  • tidb-cloud-lake/guides/load-semi-structured-formats.md
    • Added an overview of loading semi-structured formats, detailing supported file types and their guides.
  • tidb-cloud-lake/guides/load-tsv.md
    • Added documentation for loading TSV files into Databend from an internal stage.
  • tidb-cloud-lake/guides/load-with-addax.md
    • Added documentation for loading data with Addax, including DatabendReader & DatabendWriter plugins.
  • tidb-cloud-lake/guides/load-with-airbyte.md
    • Added documentation for loading data with Airbyte, including Databend destination configuration.
  • tidb-cloud-lake/guides/load-with-datax.md
    • Added documentation for loading data with DataX, including the DatabendWriter plugin.
  • tidb-cloud-lake/guides/load-with-dbt.md
    • Added documentation for loading data with dbt, including dbt-databend-cloud installation and project setup.
  • tidb-cloud-lake/guides/load-with-debezium.md
    • Added documentation for loading data with Debezium, including debezium-server-databend installation and usage.
  • tidb-cloud-lake/guides/load-with-flink-cdc.md
    • Added documentation for loading data with Flink CDC.
  • tidb-cloud-lake/guides/load-with-kafka.md
    • Added documentation for loading data with Kafka, covering databend-kafka-connect and bend-ingest-kafka.
  • tidb-cloud-lake/guides/load-with-tapdata.md
    • Added documentation for loading data with Tapdata, including agent deployment and connection setup.
  • tidb-cloud-lake/guides/load-with-vector.md
    • Added documentation for loading data with Vector, including Databend sink configuration and Nginx access log example.
  • tidb-cloud-lake/guides/manage-costs.md
    • Added documentation for Managing Costs, covering spending limits and granting access to finance personnel.
  • tidb-cloud-lake/guides/masking-policy.md
    • Added documentation for Masking Policy, explaining its functionality, workflow, and management commands.
  • tidb-cloud-lake/guides/mcp-client-integration.md
    • Added documentation for MCP Client Integration, covering setup with various clients and available tools.
  • tidb-cloud-lake/guides/mcp-server.md
    • Added documentation for MCP Server for Databend, explaining its capabilities and how to build a ChatBI tool.
  • tidb-cloud-lake/guides/metabase.md
    • Added documentation for integrating Metabase with Databend, covering driver installation and connection setup.
  • tidb-cloud-lake/guides/monitor-usage.md
    • Added documentation for Monitoring Usage, detailing metrics, SQL history, task history, and audit logs.
  • tidb-cloud-lake/guides/multimodal-data-analytics.md
    • Added an overview of Multimodal Data Analytics, outlining guides for SQL, JSON, Vector, Geo, and Lakehouse ETL.
  • tidb-cloud-lake/guides/network-policy.md
    • Added documentation for Network Policy, explaining its function, setup, and management.
  • tidb-cloud-lake/guides/ngram-index.md
    • Added documentation for Ngram Index, detailing its use for fast pattern matching in LIKE queries, comparison with full-text index, and examples.
  • tidb-cloud-lake/guides/organization-members.md
    • Added documentation for Organization & Members, explaining organization concepts, member management, and invitations.
  • tidb-cloud-lake/guides/ownership.md
    • Added documentation for Ownership, explaining how it works, granting, and revoking ownership.
  • tidb-cloud-lake/guides/password-policy.md
    • Added documentation for Password Policy, explaining its function, setup, and management.
  • tidb-cloud-lake/guides/performance-optimization.md
    • Added an overview of Performance Optimization features, including various indexing technologies.
  • tidb-cloud-lake/guides/platforms-regions.md
    • Added documentation for Platforms & Regions, providing information on supported cloud platforms and regions.
  • tidb-cloud-lake/guides/pricing-billing.md
    • Added documentation for Pricing & Billing, detailing cost management and support service levels.
  • tidb-cloud-lake/guides/privileges.md
    • Added documentation for Privileges, explaining their role in access control, management, and various types of privileges.
  • tidb-cloud-lake/guides/query-avro-files-in-stage.md
    • Added documentation for querying Avro files directly from a stage, including syntax, features, and type mapping.
  • tidb-cloud-lake/guides/query-csv-files-in-stage.md
    • Added documentation for querying CSV files directly from a stage, including syntax and tutorial.
  • tidb-cloud-lake/guides/query-ndjson-files-in-stage.md
    • Added documentation for querying NDJSON files directly from a stage, including syntax and tutorial.
  • tidb-cloud-lake/guides/query-parquet-files-in-stage.md
    • Added documentation for querying Parquet files directly from a stage, including syntax and tutorial.
  • tidb-cloud-lake/guides/query-result-cache.md
    • Added documentation for Query Result Cache, explaining usage conditions, configuration, and management.
  • tidb-cloud-lake/guides/query-staged-orc-files-in-stage.md
    • Added documentation for querying ORC files directly from a stage, including syntax and tutorial.
  • tidb-cloud-lake/guides/query-tsv-files-in-stage.md
    • Added documentation for querying TSV files directly from a stage, including syntax and tutorial.
  • tidb-cloud-lake/guides/querying-transforming.md
    • Added documentation for Querying & Transforming data on load, covering syntax, FROM clause, and various query methods.
  • tidb-cloud-lake/guides/recovery-from-operational-errors.md
    • Added documentation for Recovery from Operational Errors, detailing solutions for dropped databases/tables, incorrect data, and schema evolution rollbacks.
  • tidb-cloud-lake/guides/redash.md
    • Added documentation for integrating Redash with Databend, covering deployment and data source configuration.
  • tidb-cloud-lake/guides/roles.md
    • Added documentation for Roles, explaining inheritance, built-in roles, default roles, and business-aligned role models.
  • tidb-cloud-lake/guides/security-reliability.md
    • Added an overview of Security & Reliability features, including access control, audit trail, network/password/masking policies, fail-safe, and error recovery.
  • tidb-cloud-lake/guides/sql-analytics.md
    • Added documentation for SQL Analytics, covering base table creation, filtering, join patterns, aggregations, window functions, aggregating index boosts, and stored procedure automation.
  • tidb-cloud-lake/guides/stage-overview.md
    • Added an overview of Stage functionality, explaining internal, external, and user stages, and management commands.
  • tidb-cloud-lake/guides/superset.md
    • Added documentation for integrating Superset with Databend Cloud, covering image building and connection setup.
  • tidb-cloud-lake/guides/support-services.md
    • Added documentation for Support Services, detailing how to get support and service levels.
  • tidb-cloud-lake/guides/tableau.md
    • Added documentation for integrating Tableau with Databend, covering JDBC interface and custom connector methods.
  • tidb-cloud-lake/guides/tidb-cloud-lake-architecture.md
    • Added documentation for TiDB Cloud Lake Architecture, detailing the Meta-Service, Compute, and Storage layers.
  • tidb-cloud-lake/guides/track-and-transform-data-via-streams.md
    • Added documentation for Tracking and Transforming Data via Streams, including append-only and standard stream examples.
  • tidb-cloud-lake/guides/track-metrics.md
    • Added documentation for Tracking Metrics with Prometheus, covering integration steps and available metrics.
  • tidb-cloud-lake/guides/transform-data-on-load.md
    • Added documentation for Transforming Data on Load, demonstrating various transformation tutorials during the loading process.
  • tidb-cloud-lake/guides/unload-csv-file.md
    • Added documentation for unloading CSV files from Databend to an external stage.
  • tidb-cloud-lake/guides/unload-data-from-databend.md
    • Added an overview of unloading data from Databend, detailing supported file formats and storage destinations.
  • tidb-cloud-lake/guides/unload-ndjson-file.md
    • Added documentation for unloading NDJSON files from Databend to an external stage.
  • tidb-cloud-lake/guides/unload-parquet-file.md
    • Added documentation for unloading Parquet files from Databend to an external stage.
  • tidb-cloud-lake/guides/unload-tsv-file.md
    • Added documentation for unloading TSV files from Databend to an external stage.
  • tidb-cloud-lake/guides/upload-to-stage.md
    • Added documentation for uploading files to stages using presigned URLs and PUT commands.
  • tidb-cloud-lake/guides/vector-search.md
    • Added documentation for Vector Search, covering embedding table preparation, cosine search, and enriching similar frames.
  • tidb-cloud-lake/guides/virtual-column.md
    • Added documentation for Virtual Column, explaining its role in JSON data acceleration, how it works, and examples.
  • tidb-cloud-lake/guides/warehouse.md
    • Added documentation for Warehouses, covering sizes, management, access control, multi-cluster features, and connection methods.
  • tidb-cloud-lake/guides/worksheet.md
    • Added documentation for Worksheets, covering creation, editing, running SQL statements, sharing, and exporting query results.
  • tidb-cloud-lake/lake-overview.md
    • Added an overview of TiDB Cloud Lake, summarizing its features and benefits.
  • tidb-cloud-lake/lake-quick-start.md
    • Added a quick start guide for TiDB Cloud Lake.
  • tidb-cloud-lake/sql/abs.md
    • Added documentation for the ABS SQL function.
  • tidb-cloud-lake/sql/acos.md
    • Added documentation for the ACOS SQL function.
  • tidb-cloud-lake/sql/add-months.md
    • Added documentation for the ADD_MONTHS SQL function.
  • tidb-cloud-lake/sql/add-time-interval.md
    • Added documentation for various ADD TIME INTERVAL SQL functions (ADD_YEARS, ADD_QUARTERS, etc.).
  • tidb-cloud-lake/sql/add.md
    • Added documentation for the ADD SQL function, an alias for PLUS.
  • tidb-cloud-lake/sql/administration-commands.md
    • Added an overview of Administration Commands, categorizing system monitoring, access control, configuration, function management, storage maintenance, and dynamic execution.
  • tidb-cloud-lake/sql/age.md
    • Added documentation for the AGE SQL function.
  • tidb-cloud-lake/sql/aggregate-functions.md
    • Added an overview of Aggregate Functions, categorized by basic, conditional, statistical, percentile, array, string, JSON, argument selection, funnel analysis, and anonymization.
  • tidb-cloud-lake/sql/aggregating-index.md
    • Added an overview of Aggregating Index operations and related topics.
  • tidb-cloud-lake/sql/alter-cluster-key.md
    • Added documentation for the ALTER CLUSTER KEY SQL command.
  • tidb-cloud-lake/sql/alter-function-sql.md
    • Added documentation for the ALTER FUNCTION SQL command for external functions.
  • tidb-cloud-lake/sql/alter-function.md
    • Added documentation for the ALTER FUNCTION SQL command for user-defined functions.
  • tidb-cloud-lake/sql/alter-network-policy.md
    • Added documentation for the ALTER NETWORK POLICY SQL command.
  • tidb-cloud-lake/sql/alter-notification-integration.md
    • Added documentation for the ALTER NOTIFICATION INTEGRATION SQL command.
  • tidb-cloud-lake/sql/alter-password-policy.md
    • Added documentation for the ALTER PASSWORD POLICY SQL command.
  • tidb-cloud-lake/sql/alter-table.md
    • Added comprehensive documentation for the ALTER TABLE SQL command, covering column operations, table comments, Fuse Engine options, external table connections, and swapping tables.
  • tidb-cloud-lake/sql/alter-task.md
    • Added documentation for the ALTER TASK SQL command.
  • tidb-cloud-lake/sql/alter-user.md
    • Added documentation for the ALTER USER SQL command.
  • tidb-cloud-lake/sql/alter-view.md
    • Added documentation for the ALTER VIEW SQL command.
  • tidb-cloud-lake/sql/alter-warehouse.md
    • Added documentation for the ALTER WAREHOUSE SQL command.
  • tidb-cloud-lake/sql/alter-workload-group.md
    • Added documentation for the ALTER WORKLOAD GROUP SQL command.
  • tidb-cloud-lake/sql/any-value.md
    • Added documentation for the ANY_VALUE SQL function.
  • tidb-cloud-lake/sql/apache-hive-tables.md
    • Added documentation for Apache Hive Tables, covering quick start, metadata refresh, data type mapping, and limitations.
  • tidb-cloud-lake/sql/apache-icebergtm-tables.md
    • Added documentation for Apache Iceberg™ Tables, covering quick start, datatype mapping, catalog management, caching, and writing to tables.
  • tidb-cloud-lake/sql/approx-count-distinct.md
    • Added documentation for the APPROX_COUNT_DISTINCT SQL function.
  • tidb-cloud-lake/sql/arg-max.md
    • Added documentation for the ARG_MAX SQL function.
  • tidb-cloud-lake/sql/arg-min.md
    • Added documentation for the ARG_MIN SQL function.
  • tidb-cloud-lake/sql/arithmetic-operators.md
    • Added documentation for Arithmetic Operators.
  • tidb-cloud-lake/sql/array-agg.md
    • Added documentation for the ARRAY_AGG SQL function.
  • tidb-cloud-lake/sql/array-aggregate.md
    • Added documentation for the ARRAY_AGGREGATE SQL function.
  • tidb-cloud-lake/sql/array-any.md
    • Added documentation for the ARRAY_ANY SQL function.
  • tidb-cloud-lake/sql/array-append.md
    • Added documentation for the ARRAY_APPEND SQL function.
  • tidb-cloud-lake/sql/array-approx-count-distinct.md
    • Added documentation for the ARRAY_APPROX_COUNT_DISTINCT SQL function.
  • tidb-cloud-lake/sql/array-avg.md
    • Added documentation for the ARRAY_AVG SQL function.
  • tidb-cloud-lake/sql/array-compact.md
    • Added documentation for the ARRAY_COMPACT SQL function.
  • tidb-cloud-lake/sql/array-concat.md
    • Added documentation for the ARRAY_CONCAT SQL function.
  • tidb-cloud-lake/sql/array-construct.md
    • Added documentation for the ARRAY_CONSTRUCT SQL function.
  • tidb-cloud-lake/sql/array-contains.md
    • Added documentation for the ARRAY_CONTAINS SQL function.
  • tidb-cloud-lake/sql/array-count.md
    • Added documentation for the ARRAY_COUNT SQL function.
  • tidb-cloud-lake/sql/array-distinct.md
    • Added documentation for the ARRAY_DISTINCT SQL function.
  • tidb-cloud-lake/sql/array-except.md
    • Added documentation for the ARRAY_EXCEPT SQL function.
  • tidb-cloud-lake/sql/array-filter.md
    • Added documentation for the ARRAY_FILTER SQL function.
  • tidb-cloud-lake/sql/array-flatten.md
    • Added documentation for the ARRAY_FLATTEN SQL function.
  • tidb-cloud-lake/sql/array-functions.md
    • Added an overview of Array Functions, categorized by creation, access, modification, combination, set operations, processing, aggregations, and formatting.
  • tidb-cloud-lake/sql/array-generate-range.md
    • Added documentation for the ARRAY_GENERATE_RANGE SQL function.
  • tidb-cloud-lake/sql/array-get.md
    • Added documentation for the ARRAY_GET SQL function, an alias for GET.
  • tidb-cloud-lake/sql/array-indexof.md
    • Added documentation for the ARRAY_INDEXOF SQL function.
  • tidb-cloud-lake/sql/array-insert.md
    • Added documentation for the ARRAY_INSERT SQL function.
  • tidb-cloud-lake/sql/array-intersection.md
    • Added documentation for the ARRAY_INTERSECTION SQL function.
  • tidb-cloud-lake/sql/array-kurtosis.md
    • Added documentation for the ARRAY_KURTOSIS SQL function.
  • tidb-cloud-lake/sql/array-max.md
    • Added documentation for the ARRAY_MAX SQL function.
  • tidb-cloud-lake/sql/array-median.md
    • Added documentation for the ARRAY_MEDIAN SQL function.
  • tidb-cloud-lake/sql/array-min.md
    • Added documentation for the ARRAY_MIN SQL function.
  • tidb-cloud-lake/sql/array-overlap.md
    • Added documentation for the ARRAY_OVERLAP SQL function.
  • tidb-cloud-lake/sql/array-prepend.md
    • Added documentation for the ARRAY_PREPEND SQL function.
  • tidb-cloud-lake/sql/array-reduce.md
    • Added documentation for the ARRAY_REDUCE SQL function.
  • tidb-cloud-lake/sql/array-remove-first.md
    • Added documentation for the ARRAY_REMOVE_FIRST SQL function.
  • tidb-cloud-lake/sql/array-remove-last.md
    • Added documentation for the ARRAY_REMOVE_LAST SQL function.
  • tidb-cloud-lake/sql/array-remove.md
    • Added documentation for the ARRAY_REMOVE SQL function.
  • tidb-cloud-lake/sql/array-reverse.md
    • Added documentation for the ARRAY_REVERSE SQL function.
  • tidb-cloud-lake/sql/array-size.md
    • Added documentation for the ARRAY_SIZE SQL function.
  • tidb-cloud-lake/sql/array-skewness.md
    • Added documentation for the ARRAY_SKEWNESS SQL function.
  • tidb-cloud-lake/sql/array-slice.md
    • Added documentation for the ARRAY_SLICE SQL function.
  • tidb-cloud-lake/sql/array-sort.md
    • Added documentation for the ARRAY_SORT SQL function.
  • tidb-cloud-lake/sql/array-sql.md
    • Added documentation for the ARRAY SQL data type.
  • tidb-cloud-lake/sql/array-stddev-pop.md
    • Added documentation for the ARRAY_STDDEV_POP SQL function.
  • tidb-cloud-lake/sql/array-stddev-samp.md
    • Added documentation for the ARRAY_STDDEV_SAMP SQL function.
  • tidb-cloud-lake/sql/array-string.md
    • Added documentation for the ARRAY_TO_STRING SQL function.
  • tidb-cloud-lake/sql/array-sum.md
    • Added documentation for the ARRAY_SUM SQL function.
  • tidb-cloud-lake/sql/array-unique.md
    • Added documentation for the ARRAY_UNIQUE SQL function.
  • tidb-cloud-lake/sql/arrays-zip.md
    • Added documentation for the ARRAYS_ZIP SQL function.
  • tidb-cloud-lake/sql/as-array.md
    • Added documentation for the AS_ARRAY SQL function.
  • tidb-cloud-lake/sql/as-binary.md
    • Added documentation for the AS_BINARY SQL function.
  • tidb-cloud-lake/sql/as-boolean.md
    • Added documentation for the AS_BOOLEAN SQL function.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

The pull request introduces new documentation for TiDB Cloud Lake, covering various guides, tutorials, and SQL references. The review comments highlight several issues related to inconsistent product naming (replacing "Databend" with "TiDB Cloud Lake"), incorrect relative links, a typo in "atabend", redundant phrasing, and a stray backtick in the Table of Contents. These issues require correction to ensure clarity, consistency, and proper rendering of the documentation.

lilin90 added 5 commits March 12, 2026 18:02
Standardize and fix internal documentation links and labels for the TiDB Cloud Lake docs. Updated TOC entries to shorter titles, fixed several relative links to point to /tidb-cloud-lake paths (load-from-local-file, load-from-remote-file, and file format references), cleaned up minor whitespace in the input-output-file-formats doc, and added new variables (lake, lake-short) in variables.json.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/tidb-cloud This PR relates to the area of TiDB Cloud. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. translation/doing This PR's assignee is translating this PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant