[
https://issues.apache.org/jira/browse/HUDI-2364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-2364:
--------------------------------------
Fix Version/s: 0.11.0
> Run compaction without user schema file provided
> ------------------------------------------------
>
> Key: HUDI-2364
> URL: https://issues.apache.org/jira/browse/HUDI-2364
> Project: Apache Hudi
> Issue Type: New Feature
> Reporter: Wenning Ding
> Priority: Major
> Labels: core-flow-ds, pull-request-available, sev:normal
> Fix For: 0.11.0
>
>
> Currently to run Hudi compaction manually, customers have to pass the avsc
> file of data schema by themselves,
> e.g. in Hudi CLI,
>
> {{}}
> {code:java}
> compaction run --compactionInstant 20201203005420 \ --parallelism 2
> --sparkMemory 2G \ --schemaFilePath s3://xxx/hudi/mor_schema.avsc \
> --propsFilePath file:///home/hadoop/config.properties --retry 1
> {code}
> Let customers provide avsc file is not a good option. Some customers don’t
> know how to generate this schema file, and some customers pass the wrong
> schema file and get other exceptions. We should handle this logic inside Hudi
> if possible.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)