[ 
https://issues.apache.org/jira/browse/HUDI-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lin Liu updated HUDI-9561:
--------------------------
    Description: 
We tested schema evolution support from FG reader through three tests:

CASE1: Two base files have different schemas.

CASE2: One base files with multiple log files, and log files have different 
schemas.

CASE3: One base file and one log file have different schemas.

Using the above tests, we found that there are some failures for:
 # Adding nested column for Hive: CASE1, CASE2 
 # Adding nested column for Java: CASE2
 # Type upgrade from int to long Hive: CASE2
 # All test cases failed for Flink: probably due to some bugs in the test.

The PR for running tests: [https://github.com/apache/hudi/pull/13443]

  was:
We tested schema evolution support from FG reader through three tests:

CASE1: Two base files have different schemas.

CASE2: One base files with multiple log files, and log files have different 
schemas.

CASE3: One base file and one log file have different schemas.

Using the above tests, we found that there are some failures for:
 # Adding nested column for Hive: CASE1, CASE2 
 # Adding nested column for Java: CASE2
 # Type upgrade from int to long Hive: CASE2



The PR for running tests: https://github.com/apache/hudi/pull/13443


> Fix schema evolution gaps for FG reader
> ---------------------------------------
>
>                 Key: HUDI-9561
>                 URL: https://issues.apache.org/jira/browse/HUDI-9561
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Lin Liu
>            Assignee: Lokesh Jain
>            Priority: Major
>             Fix For: 1.1.0
>
>
> We tested schema evolution support from FG reader through three tests:
> CASE1: Two base files have different schemas.
> CASE2: One base files with multiple log files, and log files have different 
> schemas.
> CASE3: One base file and one log file have different schemas.
> Using the above tests, we found that there are some failures for:
>  # Adding nested column for Hive: CASE1, CASE2 
>  # Adding nested column for Java: CASE2
>  # Type upgrade from int to long Hive: CASE2
>  # All test cases failed for Flink: probably due to some bugs in the test.
> The PR for running tests: [https://github.com/apache/hudi/pull/13443]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to