[jira] [Assigned] (HUDI-1301) use spark INCREMENTAL mode query hudi dataset support schema version

Vinoth Chandar (Jira) Wed, 20 Jan 2021 21:42:05 -0800


     [ 
https://issues.apache.org/jira/browse/HUDI-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Vinoth Chandar reassigned HUDI-1301:
------------------------------------

    Assignee: Vinoth Chandar  (was: liwei)

> use spark INCREMENTAL mode query hudi  dataset support schema version
> ---------------------------------------------------------------------
>
>                 Key: HUDI-1301
>                 URL: https://issues.apache.org/jira/browse/HUDI-1301
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: Spark Integration
>            Reporter: liwei
>            Assignee: Vinoth Chandar
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.7.0
>
>
> 一、issue
> 1、 at write hand ,write two commit , second commit add a column such as:
> commit1 schema and data
> id , name 
> 1, lisi
>  
> commit2  schema and data
> id, name , age
> 2, zhangsan, 18
>  
> 2、at read hand,
> read the latest commit return
> id, name , age
> 1, lisi, null
> 2, zhangsan, 18
>  
> read the first commit by set  END_INSTANTTIME_OPT_KEY to first commit, will 
> return 
> id, name , age
> 1, lisi, null
>  
> 二、solution
> we can see that read the first commit alse return "age" column. i think if   
> set  END_INSTANTTIME_OPT_KEY to first commit,  both schema and data should 
> with that commit.
>  more clearness should return 
> id, name 
> 1, lisi
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Assigned] (HUDI-1301) use spark INCREMENTAL mode query hudi dataset support schema version

Reply via email to