xushiyan commented on issue #1846: URL: https://github.com/apache/hudi/issues/1846#issuecomment-662038478
> @xushiyan I want to replace contents of existing table. ie read existing 10k small files from tableA and replace tableA with 20 big files @tooptoop4 as i mentioned, HoodieSnapshotCopier is meant for fresh back-up via plain copying. What you want is rewriting data, which involves a hoodie writer job, not available in HoodieSnapshotCopier. Nevertheless, it might make sense for `HoodieSnapshotExporter` to incorporate such feature in the future. cc @vinothchandar @bvaradar Seems like ad-hoc transformation/re-writing is getting more often. This resizing could be another use case? a related feature is noted here https://issues.apache.org/jira/browse/HUDI-767 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
