[
https://issues.apache.org/jira/browse/HUDI-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-1484:
---------------------------------
Labels: pull-request-available (was: )
> Escape the partition value in HiveSyncTool
> -------------------------------------------
>
> Key: HUDI-1484
> URL: https://issues.apache.org/jira/browse/HUDI-1484
> Project: Apache Hudi
> Issue Type: Bug
> Components: Writer Core
> Reporter: pengzhiwei
> Assignee: pengzhiwei
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.7.0
>
>
> Currently Hudi will encode the partition value when
> URL_ENCODE_PARTITIONING_OPT_KEY set true. However the HiveSyncTool has not
> decode the partition value when sync partition to hive. And Hive will encode
> the partition value twice which lead to an exception when query with hive sql
> or spark sql.
> For example the partition *"2020/12/20*" will encode to *"2020%2F12%2F20"* by
> Hudi. When HiveSyncTool sync the *"2020%2F12%2F20"* to hive, Hive will encode
> it to "
> *"2020%252F12%252F20".* This will result to query exception for "select xx
> from tbl where dt = '2020/12/20'
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)