duckdb · szarnyasg · Sep 9, 2024 · Mar 26, 2024 · Sep 9, 2024
diff --git a/docs/data/parquet/tips.md b/docs/data/parquet/tips.md
@@ -51,4 +51,18 @@ COPY
     (FORMAT PARQUET, ROW_GROUP_SIZE 100_000);
 ```
 
-See the [Performance Guide on file formats]({% link docs/guides/performance/file_formats.md %}#parquet-file-sizes) for more tips.
+### The `ROW_GROUPS_PER_FILE` Option
+
+The `ROW_GROUPS_PER_FILE` parameter creates a new Parquet file if the current one has a specified number of row groups.
+
+```sql
+COPY
+    (FROM generate_series(100_000))
+    TO 'output-directory'
+    (FORMAT PARQUET, ROW_GROUP_SIZE 20_000, ROW_GROUPS_PER_FILE 2);
+```
+
+> If multiple threads are active, the number of row groups in a file may slightly exceed the specified number of row groups to limit the amount of locking – similarly to the behaviour of [`FILE_SIZE_BYTES`](../../sql/statements/copy#copy--to-options).
+> However, if `PER_THREAD_OUTPUT` is set, only one thread writes to each file, and it becomes accurate again.
+
+See the [Performance Guide on “File Formats”]({% link docs/guides/performance/file_formats.md %}#parquet-file-sizes) for more tips.
diff --git a/docs/sql/statements/copy.md b/docs/sql/statements/copy.md
@@ -273,7 +273,7 @@ The below options are applicable when writing `Parquet` files.
 | `FIELD_IDS` | The `field_id` for each column. Pass `auto` to attempt to infer automatically. | `STRUCT` | (empty) |
 | `ROW_GROUP_SIZE_BYTES` | The target size of each row group. You can pass either a human-readable string, e.g., `2MB`, or an integer, i.e., the number of bytes. This option is only used when you have issued `SET preserve_insertion_order = false;`, otherwise, it is ignored. | `BIGINT` | `row_group_size * 1024` |
 | `ROW_GROUP_SIZE` | The target size, i.e., number of rows, of each row group. | `BIGINT` | 122880 |
-
+| `ROW_GROUPS_PER_FILE` | Create a new Parquet file if the current one has a specified number of row groups. If multiple threads are active, the number of row groups in a file may slightly exceed the specified number of row groups to limit the amount of locking – similarly to the behaviour of `FILE_SIZE_BYTES`. However, if `per_thread_output` is set, only one thread writes to each file, and it becomes accurate again. | `BIGINT` |  (empty) |
 Some examples of `FIELD_IDS` are:
 
 Assign `field_ids` automatically: