[
https://issues.apache.org/jira/browse/HUDI-7484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17827385#comment-17827385
]
Vova Kolmakov commented on HUDI-7484:
-------------------------------------
Hi! [~codope] Can you provide the full code sample?
It looks like it's not the case of bug in inferring partition from
partitionBy(), just the values of partition field are "2015/03/15",
"2015/03/16". If data in partitioning column may contain special characters,
then URL_ENCODE_PARTITIONING should be used while writing (which is false by
default), and it results in directory structure like this:
partition=2015%2F03%2F15
partition=2015%2F03%2F16
Partitioning by multiple fields (trip_type, rider) without slashes in its
values works correctly, producing directories like:
trip_type=BLACK
|- rider=rider-000
trip_type=UBERX
|- rider=rider-000
> Fix partitioning style when partition is inferred from partitionBy
> ------------------------------------------------------------------
>
> Key: HUDI-7484
> URL: https://issues.apache.org/jira/browse/HUDI-7484
> Project: Apache Hudi
> Issue Type: Task
> Reporter: Sagar Sumit
> Priority: Major
> Fix For: 1.0.0
>
>
> When inferring partition from partitionBy() arguments and hive style
> partitioning is enabled, we observe that the partitioining style is not
> uniformed for multi-level partition. Directory structure is as follows:
> partition=2015
> |- 03
> |- 15
> |- 16
--
This message was sent by Atlassian Jira
(v8.20.10#820010)