[
https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17114980#comment-17114980
]
Yanjia Gary Li commented on HUDI-110:
-------------------------------------
[~shivnarayan] no, the PR is not related to this ticket.
This ticket is more like exploring a new feature and will take some time. We
can remove the bug-bash tag.
> Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
> ------------------------------------------------------------------------------
>
> Key: HUDI-110
> URL: https://issues.apache.org/jira/browse/HUDI-110
> Project: Apache Hudi
> Issue Type: Improvement
> Components: DeltaStreamer, Spark Integration, Usability
> Reporter: Balaji Varadarajan
> Assignee: Yanjia Gary Li
> Priority: Minor
> Labels: bug-bash-0.6.0, pull-request-available
>
> Currently
> SlashEncodedDayPartitionValueExtractor is the default being used. This is not
> a common format outside Uber.
>
> Also, Spark DataSource provides partitionedBy clauses which has not been
> integrated for Hudi Data Source. We need to investigate how we can leverage
> partitionBy clause for partitioning.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)