[ 
https://issues.apache.org/jira/browse/HUDI-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1484:
---------------------------------
    Labels: pull-request-available  (was: )

> Escape the partition value  in HiveSyncTool
> -------------------------------------------
>
>                 Key: HUDI-1484
>                 URL: https://issues.apache.org/jira/browse/HUDI-1484
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: Writer Core
>            Reporter: pengzhiwei
>            Assignee: pengzhiwei
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.7.0
>
>
> Currently Hudi will encode the partition value when 
> URL_ENCODE_PARTITIONING_OPT_KEY set true. However the HiveSyncTool has not 
> decode the partition value when sync partition to hive. And Hive will encode 
> the partition value twice which lead to an exception when query with hive sql 
> or spark sql.
> For example the partition *"2020/12/20*" will encode to *"2020%2F12%2F20"* by 
> Hudi. When HiveSyncTool sync the *"2020%2F12%2F20"* to hive, Hive will encode 
> it to "
> *"2020%252F12%252F20".* This will result to query exception for "select xx 
> from tbl where dt = '2020/12/20'
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to