[ 
https://issues.apache.org/jira/browse/HUDI-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gary Li resolved HUDI-1611.
---------------------------
    Resolution: Resolved

> Allow directories to be filtered during the bootstrap of the metadata table
> ---------------------------------------------------------------------------
>
>                 Key: HUDI-1611
>                 URL: https://issues.apache.org/jira/browse/HUDI-1611
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: Prashant Wason
>            Assignee: Prashant Wason
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.8.0
>
>
> During the bootstrap of the Metadata Table, all the directories which contain 
> the partition metadata directory are assumed to be partitions and are added 
> to the metadata table.
> In our HDFS clusters, we have directories like .backup, .temp which are used 
> by various teams for non-hoodie purposes (e.g. .backup may be keeping a 
> snapshot of the dataset). During bootstrap, Metadata Table ends up containing 
> all those paths also as partitions.
> In this patch, I would like to introduce a configuration for 
> HoodieMetadataConfig to filter out some directories based on a regular 
> expression string. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to