Teach Datafusion to project only accessed struct leaves in row filter pushdown by friendlymatthew · Pull Request #20854 · apache/datafusion

friendlymatthew · 2026-03-10T18:35:27Z

Which issue does this PR close?

Rationale for this change

This PR refines how the FilterCandidateBuilder projects struct columns during Parquet row filter pushdown.

Previously, a filter like s['value'] > 10 would cause the reader to decode all leaf columns of a struct s, because PushdownChecker only tracked the root column index and expanded it to every leaf. This wastes I/O and decode time on fields the filter never touches

Now, the builder resolves only the matching Parquet leaf columns. It does this by building a pruned filter schema that reflects exactly what the Parquet reader produces when projecting a subset of struct leaves, ensuring the expression evaluates against the correct types

friendlymatthew

self review

datafusion/datasource-parquet/src/row_filter.rs

friendlymatthew · 2026-03-10T18:50:28Z

datafusion/datasource-parquet/src/row_filter.rs

        };

-        self.required_columns.push(idx);
+        self.struct_field_accesses.push(StructFieldAccess {


struct field accesses are tracked in a separate vec rather than being pushed into required_columns

This is intentional since required_columns feeds into leaf_indices_for_roots which expands a root index to all its leaves. By keeping the struct accesses separate, we can resolve them to only the specific leaves needed via resolve_struct_field_leaves

cetra3 · 2026-03-10T20:25:00Z

datafusion/datasource-parquet/src/row_filter.rs

+            schema_descr,
        );
+        leaf_indices.extend_from_slice(&struct_leaf_indices);
+        leaf_indices.sort_unstable();


Will this cause errors if the sorting is different from the root_indices

To my knowledge, no.

leaf_indices is only used to build a ProjectionMask::leaves

datafusion/datafusion/datasource-parquet/src/row_filter.rs

Lines 144 to 149 in 5af7361

// Use leaf indices: when nested columns are involved, we must specify

// leaf (primitive) column indices in the Parquet schema so the decoder

// can properly project and filter nested structures.

projection_mask: ProjectionMask::leaves(

metadata.file_metadata().schema_descr(),

candidate.projection.leaf_indices.iter().copied(),

ProjectionMask does not care about order. It builds a boolean mask of size vec![false; num_columns] and sets via mask[leaf_idx] = true.

Other call sites that use leaf_indices isn't considering order

Ok, then is that change needed? I.e, the sort and dedup

leaf_indices and root_indices serve different purposes. leaf indices become the ProjectionMask, telling the parquet decoder which physical leaf columns to read from disk. root indicies (+ struct field accesses) become the filter schema, telling Arrow what schema to use when reconstructing the record batch

Arrow just takes whatever decoded leaves are available and assembles them into the schema it was given. So suppose you had leaf_indices=[2] with root_indices[1]. The masks says decode leaf 2 and the schema says give me struct column 1, pruned to just this specific field

sort and dedup are necessary since we concatenate leaf_indices_for_roots and resolve_struct_field_leaves. The first iterates parquet leaves 0..N collecting those belonging to regular (non-struct) cols, and the second does the same for struct field accesses. Both produce individually sorted output, but when a struct column appears before a regular column in the schema, the struct's leaf indices are numerically lower.

Dedup is needed because the same struct field can appear multiple times in a filter expression like get_field(s, 'val') > 5 and get_field(s, 'val') < 100 producing duplicate entries in struct_field_accesses. Without dedup, we'd double count the compressed size of that column

adriangb · 2026-03-11T16:37:53Z

datafusion/datasource-parquet/src/row_filter.rs

+            self.projected_columns = true;
+            return Some(TreeNodeRecursion::Jump);


I know this is pre existing, but what is this code path supposed to catch? Under what circumstances would there be a column that doesn't exist in the file schema at this point, and why is it a "projected" column?

This can happen during schema evolution (a column that was added after the file was written) or partition columns that are projected onto the scan but don't exist physically on disk

In either case, we can't push the filter down because the decoder has no data for this column

I'll add a comment explaining this or do it in a follow up PR

I don't think that's true. We resolve partition columns to literals before we reach this point. And any missing columns would have also been replaced with null literals.

datafusion/datasource-parquet/src/row_filter.rs

adriangb · 2026-03-11T16:40:52Z

datafusion/datasource-parquet/src/row_filter.rs

+    /// Field names forming the path into the struct.
+    /// e.g., `["value"]` for `s['value']`, `["outer", "inner"]` for `s['outer']['inner']`.


I assume we don't support stuff like array_has_any(get_field(s, 'items'), 5)?

We do support it! Here's a test that repros: 44a02f3

adriangb · 2026-03-11T16:43:06Z

datafusion/datasource-parquet/src/row_filter.rs

+fn resolve_struct_field_leaves(
+    accesses: &[StructFieldAccess],
+    file_schema: &Schema,
+    schema_descr: &SchemaDescriptor,
+) -> Vec<usize> {


Let's make sure we share this logic for projections.

More generally: there should be a single place where there is a function along the lines of:

fn build_parquet_read_plan(expr: &Arc<dyn PhysicalExpr>) -> ParquetReadPlan { ... } struct ParquetReadPlan { // leaf projections projection_mask: ProjectionMask, // the schema to read back with schema: SchemaRef, // the transformed expression (do we need this?) expr: Arc<dyn PhysicalExpr> }

Think this makes a lot of sense. I've charted a big picture idea of what this refactor will look like: #20913

adriangb · 2026-03-11T16:44:41Z

datafusion/datasource-parquet/src/row_filter.rs

+/// filter expression.
+///
+/// For regular (non-struct) columns, the full field type is used.
+/// For struct columns accessed via `get_field`, a pruned struct type is created


Instead of a pruned struct type, why not transform the expression from get_field(s, 'f') -> Alias(Column("s.f", 123), "get_field(s, 'f')")` or something like that? Then we don't need to manipulate the data (assemble a struct with pruned fields)

The constraint is what arrow's parquet reader produces. ProjectionMask::leaves still returns a nested StructArray with only the projected fields, not flat top-level cols. So Column("s.f") wouldn't match the reader output without an extra flattening step in ArrowPredicate::evaluate

The pruned struct approach keeps the schema in sync with what the reader naturally returns. We'd need to invent a way to get flat columns from the arrow parquet reader to make that work

Okay interesting. I would have thought that ProjectionMask::leaves returned a flat column, not a nested struct. Thanks for explaining 😄

github-actions bot added the datasource Changes to the datasource crate label Mar 10, 2026

handle struct leaf projection

15b2586

friendlymatthew force-pushed the friendlymatthew/struct-leaf-projection branch from d24acc3 to 15b2586 Compare March 10, 2026 18:41

friendlymatthew commented Mar 10, 2026

View reviewed changes

cetra3 reviewed Mar 10, 2026

View reviewed changes

friendlymatthew mentioned this pull request Mar 11, 2026

Support row group pruning for struct field predicates #20871

Open

adriangb requested changes Mar 11, 2026

View reviewed changes

friendlymatthew added 2 commits March 12, 2026 12:39

address feedback

156b5dc

add test showing list predicate wrapping

8dba654

friendlymatthew force-pushed the friendlymatthew/struct-leaf-projection branch from 44a02f3 to 8dba654 Compare March 12, 2026 17:00

friendlymatthew mentioned this pull request Mar 12, 2026

Extract shared ParquetReadPlan for leaf column resolution #20913

Open

	// Use leaf indices: when nested columns are involved, we must specify
	// leaf (primitive) column indices in the Parquet schema so the decoder
	// can properly project and filter nested structures.
	projection_mask: ProjectionMask::leaves(
	metadata.file_metadata().schema_descr(),
	candidate.projection.leaf_indices.iter().copied(),

		self.projected_columns = true;
		return Some(TreeNodeRecursion::Jump);

		/// Field names forming the path into the struct.
		/// e.g., `["value"]` for `s['value']`, `["outer", "inner"]` for `s['outer']['inner']`.

Conversation

friendlymatthew commented Mar 10, 2026

Which issue does this PR close?

Rationale for this change

Uh oh!

friendlymatthew left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants