[ 
https://issues.apache.org/jira/browse/HUDI-757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072394#comment-17072394
 ] 

Prashant Wason commented on HUDI-757:
-------------------------------------

Test run on my setup on a hudi table which exported 2761 instants into a local 
directory.

 

hudi:pwason_db.pwason_test_table->export instants --localFolder 
/home/pwason/instant_dump

...

Exported 2761 Instants

 

ls -al  /home/pwason/instant_dump/

drwxrwxr-x 2 pwason users 131072 Mar 31 20:55 .
drwxr-xr-x 17 pwason users 4096 Mar 31 19:56 ..
-rw-rw-r-- 1 pwason users 15000711 Mar 31 20:55 20200228063343.commit
-rw-rw-r-- 1 pwason users 12042716 Mar 31 20:55 20200228094924.commit
-rw-rw-r-- 1 pwason users 73320 Mar 31 20:55 20200228094925.clean
-rw-rw-r-- 1 pwason users 11420128 Mar 31 20:55 20200228211516.commit
-rw-rw-r-- 1 pwason users 73320 Mar 31 20:55 20200228211540.clean
-rw-rw-r-- 1 pwason users 7567466 Mar 31 20:55 20200228221520.commit

> Add a command to hudi-cli to export commit metadata
> ---------------------------------------------------
>
>                 Key: HUDI-757
>                 URL: https://issues.apache.org/jira/browse/HUDI-757
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>            Reporter: Prashant Wason
>            Priority: Minor
>   Original Estimate: 4h
>  Remaining Estimate: 4h
>
> HUDI stores commit related information in files within the .hoodie directory. 
> Each commit / delatacommit / rollback / etc creates one or more files. To 
> prevent a large number of files, older files are consolidated together and 
> moved into a commit archive which has multiple such files written together 
> using the format of HUDI Log files.
> During debugging of issues or for development of new features, it may be 
> required to refer to the metadata of older commits / cleanups / rollbacks. 
> There is no simple way to get these from a production setup especially from 
> the archive files.
> This enhancement provides a hudi cli command which allows exporting metadata 
> from HUDI commit archives.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to