Skip to content

[VL] Read Parquet file generated by parquet-thrift failed #11865

@zhouyuan

Description

@zhouyuan

Backend

VL (Velox)

Bug description

https://github.com/apache/gluten/actions/runs/23858922685/job/69563146517?pr=11860

2026-04-01T17:52:37.3856475Z Error Source: RUNTIME
2026-04-01T17:52:37.3856711Z Error Code: INVALID_STATE
2026-04-01T17:52:37.3857455Z Reason: Converted type VARCHAR is not allowed for requested type ROW<nestedintscolumn:ARRAY<INTEGER>,nestedstringcolumn:VARCHAR>
2026-04-01T17:52:37.3858333Z Retriable: False
2026-04-01T17:52:37.3859258Z Expression: !requestedType || isCompatible( requestedType, isRepeated, [](const TypePtr& type) { return type->kind() == TypeKind::VARCHAR; })
2026-04-01T17:52:37.3860821Z Context: Split Hive: file:///opt/shims/spark34/spark_home/sql/core/src/test/resources/test-data/parquet-thrift-compat.snappy.parquet 0 - 10550 Task Gluten_Stage_4_TID_4_VTID_73783
2026-04-01T17:52:37.3861558Z Function: convertType
2026-04-01T17:52:37.3861924Z File: /work/ep/build-velox/build/velox_ep/velox/dwio/parquet/reader/ParquetReader.cpp

Gluten version

No response

Spark version

None

Spark configurations

No response

System information

No response

Relevant logs

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingtriage

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions