[ 
https://issues.apache.org/jira/browse/IMPALA-13364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltán Borók-Nagy resolved IMPALA-13364.
----------------------------------------
    Fix Version/s: Impala 4.5.0
       Resolution: Fixed

> Schema resolution doesn't work for migrated partitioned Iceberg tables that 
> have complex types
> ----------------------------------------------------------------------------------------------
>
>                 Key: IMPALA-13364
>                 URL: https://issues.apache.org/jira/browse/IMPALA-13364
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Zoltán Borók-Nagy
>            Priority: Major
>              Labels: impala-iceberg
>             Fix For: Impala 4.5.0
>
>
> Schema resolution doesn't work correctly for migrated partitioned Iceberg 
> tables that have complex types.
> When we face a Parquet/ORC file in an Iceberg table that doesn't have field 
> IDs in the file metadata, we assume that it is an old data file before 
> migration, and the schema is the very first one, hence we can mimic Iceberg's 
> field ID generation to assign field IDs to the file schema elements.
> This process didn't take the partition columns into account. This only 
> matters when there are complex types in the table, as partition columns are 
> always the last columns in legacy Hive tables, and field IDs are assigned via 
> a "BFS-like" traversal. I.e. if there are only primitive types in the table 
> we don't have any problems, but the children of complex types columns are 
> assigned incorrectly.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to