[ 
https://issues.apache.org/jira/browse/HUDI-7484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17827385#comment-17827385
 ] 

Vova Kolmakov commented on HUDI-7484:
-------------------------------------

Hi! [~codope] Can you provide the full code sample?

It looks like it's not the case of bug in inferring partition from 
partitionBy(), just  the values of partition field are "2015/03/15", 
"2015/03/16". If data in partitioning column may contain special characters, 
then URL_ENCODE_PARTITIONING should be used while writing (which is false by 
default), and it results in directory structure like this:

partition=2015%2F03%2F15

partition=2015%2F03%2F16

 

Partitioning by multiple fields (trip_type, rider) without slashes in its 
values works correctly, producing directories like:

trip_type=BLACK

         |- rider=rider-000

trip_type=UBERX

         |- rider=rider-000

 

> Fix partitioning style when partition is inferred from partitionBy
> ------------------------------------------------------------------
>
>                 Key: HUDI-7484
>                 URL: https://issues.apache.org/jira/browse/HUDI-7484
>             Project: Apache Hudi
>          Issue Type: Task
>            Reporter: Sagar Sumit
>            Priority: Major
>             Fix For: 1.0.0
>
>
> When inferring partition from partitionBy() arguments and hive style 
> partitioning is enabled, we observe that the partitioining style is not 
> uniformed for multi-level partition. Directory structure is as follows:
> partition=2015
>                        |- 03
>                              |- 15
>                              |- 16



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to