[ 
https://issues.apache.org/jira/browse/HUDI-2364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-2364:
--------------------------------------
    Fix Version/s: 0.11.0

> Run compaction without user schema file provided
> ------------------------------------------------
>
>                 Key: HUDI-2364
>                 URL: https://issues.apache.org/jira/browse/HUDI-2364
>             Project: Apache Hudi
>          Issue Type: New Feature
>            Reporter: Wenning Ding
>            Priority: Major
>              Labels: core-flow-ds, pull-request-available, sev:normal
>             Fix For: 0.11.0
>
>
> Currently to run Hudi compaction manually, customers have to pass the avsc 
> file of data schema by themselves,
>  e.g. in Hudi CLI,
>  
> {{}}
> {code:java}
> compaction run --compactionInstant 20201203005420 \ --parallelism 2 
> --sparkMemory 2G \ --schemaFilePath s3://xxx/hudi/mor_schema.avsc \ 
> --propsFilePath file:///home/hadoop/config.properties --retry 1
> {code}
> Let customers provide avsc file is not a good option. Some customers don’t 
> know how to generate this schema file, and some customers pass the wrong 
> schema file and get other exceptions. We should handle this logic inside Hudi 
> if possible.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to