zhedoubushishi opened a new pull request #1870: URL: https://github.com/apache/hudi/pull/1870
## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the pull request JIRA https://issues.apache.org/jira/browse/HUDI-808 This is an important requirement from GDPR perspective. When performing deletion on a metadata only bootstrapped partition, users should have the ability to tell to clean up the original data from the source location because as per this new bootstrapping mechanism the original data serves as the data in original commit for Hudi. Create an option named ```hoodie.cleaner.bootstrap.source.file```, when set it to true, users have the ability to clean the original source data for the bootstrap table. By default, it is false. ## Brief change log Create an option named ```hoodie.cleaner.bootstrap.source.file```, when set it to true, users have the ability to clean the original source data for the bootstrap table. By default, it is false. Also added corresponding unit tests for this option in TestCleaner.java. ## Verify this pull request This change added tests and can be verified as follows: - *Added two test case in ```TestCleaner.java``` to verify the change.* ## Committer checklist - [x] Has a corresponding JIRA in PR title & commit - [x] Commit message is descriptive of the change - [ ] CI is green - [ ] Necessary doc changes done or have another open PR - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
