nsivabalan commented on issue #2675:
URL: https://github.com/apache/hudi/issues/2675#issuecomment-808828818


   Yes, your approach should work. Only change is that, we might have to fix it 
where we generate avro schema from df schema in HoodieSparkSqlWriter. Eg: 
https://github.com/nsivabalan/hudi/commit/43b3fc845a7b2ea4c68f1b3fc3e13b41bfb2d17e
   (My method to regenerateSchema is not full fledged. But it does work for mor 
w/ evolved schema for string type. actual fix should look like what you have in 
your commit) We need to fix the schema in HoodieSparkSqlWriter, bcoz, thats 
what gets serialized in commit metadata and hence. Not sure if we need to fix 
HoodieSparkUtils.createRdd(). 
   
   Please go ahead and open up a PR. Would be happy to review. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to