Prashant Wason created HUDI-1611:
------------------------------------
Summary: Allow directories to be filtered during the bootstrap of
the metadata table
Key: HUDI-1611
URL: https://issues.apache.org/jira/browse/HUDI-1611
Project: Apache Hudi
Issue Type: Sub-task
Reporter: Prashant Wason
Assignee: Prashant Wason
During the bootstrap of the Metadata Table, all the directories which contain
the partition metadata directory are assumed to be partitions and are added to
the metadata table.
In our HDFS clusters, we have directories like .backup, .temp which are used by
various teams for non-hoodie purposes (e.g. .backup may be keeping a snapshot
of the dataset). During bootstrap, Metadata Table ends up containing all those
paths also as partitions.
In this patch, I would like to introduce a configuration for
HoodieMetadataConfig to filter out some directories based on a regular
expression string.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)