[jira] [Commented] (HUDI-55) Investigate support for bucketed tables ala Hive #74

Nishith Agarwal (Jira) Mon, 23 Nov 2020 17:32:34 -0800


    [ 
https://issues.apache.org/jira/browse/HUDI-55?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17237777#comment-17237777
 ]


Nishith Agarwal commented on HUDI-55:
-------------------------------------

*Hive has the bucketBy feature and spark is going to add support for HIVE style 
bucketBy support for datasources and once it’s implemented - its going to 
benefit largely on the read performance. So as HUDI is having different path 
while writing parquet data, are we planning to add bucketBy functionality? 
Seems Spark is adding features on writers to be benefitted for better read 
performance, so having a different writer for HUDI, are keeping track on these 
new features happening on Spark, therefor*

> Investigate support for bucketed tables ala Hive #74
> ----------------------------------------------------
>
>                 Key: HUDI-55
>                 URL: https://issues.apache.org/jira/browse/HUDI-55
>             Project: Apache Hudi
>          Issue Type: New Feature
>          Components: Hive Integration
>            Reporter: Vinoth Chandar
>            Priority: Major
>
> https://github.com/uber/hudi/issues/74



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (HUDI-55) Investigate support for bucketed tables ala Hive #74

Reply via email to