sivabalan narayanan created HUDI-2125:
-----------------------------------------
Summary: Create a public dataset and guideline/playbook for use by
public
Key: HUDI-2125
URL: https://issues.apache.org/jira/browse/HUDI-2125
Project: Apache Hudi
Issue Type: Improvement
Components: Usability
Reporter: sivabalan narayanan
Expose a public dataset w/ schema details and how to use them.
For eg:
* We could have a parquet dump somewhere, where one could read from generate
their own hudi tables.
* We could have playbook to create diff types of hudi tables(COW/MOR) by
reading from this source.
* We could add a playbook to use deltastreamer to read from this source one
file at a time and inject to hudi table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)