zaminhassnain06 opened a new issue, #11403:
URL: https://github.com/apache/hudi/issues/11403

   Hi
   Our organization is migrating from Hudi 0.6.0 to Hudi 0.12.1 and also 
updating the required spark and EMR versions. Our existing data sets (100s of 
TBs of data on S3) are written using Hudi 0.6.0.
   
   The latest version of Hudi has come way since 0.6.0, we are not sure about 
how to use 0.12.1 directly.
   
   Could someone provide the steps for upgrading from 0.6.0 to 0.12.1?
   
   Do we have to rebuild our tables, we are more concerned about this as tables 
are having billions of records ?
   
   Should we expect following imporvements after the upgrade: 
         – faster upserts
   
        – columns add/modify (schema evolution)
   
        – clustering
   
        – possible solution for storing history of updates performed on recrods
   
   Thanks,
   Zamin Hassnain


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to