zaminhassnain06 opened a new issue, #11403:
URL: https://github.com/apache/hudi/issues/11403
Hi
Our organization is migrating from Hudi 0.6.0 to Hudi 0.12.1 and also
updating the required spark and EMR versions. Our existing data sets (100s of
TBs of data on S3) are written using Hudi 0.6.0.
The latest version of Hudi has come way since 0.6.0, we are not sure about
how to use 0.12.1 directly.
Could someone provide the steps for upgrading from 0.6.0 to 0.12.1?
Do we have to rebuild our tables, we are more concerned about this as tables
are having billions of records ?
Should we expect following imporvements after the upgrade:
– faster upserts
– columns add/modify (schema evolution)
– clustering
– possible solution for storing history of updates performed on recrods
Thanks,
Zamin Hassnain
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]