modi95 commented on issue #1005: [HUDI-91][HUDI-12]Migrate to spark 2.4.4, 
migrate to spark-avro library instead of databricks-avro, add support for 
Decimal/Date types
URL: https://github.com/apache/incubator-hudi/pull/1005#issuecomment-552152313
 
 
   Hi Udit! Thanks for making this PR!
   
   I've been working on upgrading HUDI to Spark 2.4 internally at Uber! So I'll 
list out a few things that I had to do, so that you're not trying to 
re-discover these things yourself :) 
   
   1. Some of the `create` functions in `HoodieWrapperFileSystem` don't fully 
work with Parquet 1.10+. See 
[here](https://github.com/apache/incubator-hudi/blob/b19bed442d84c1cb1e48d184c9554920735bcb6c/hudi-common/src/main/java/org/apache/hudi/common/io/storage/HoodieWrapperFileSystem.java#L146).
 We'll need to make sure that all the `create` functions correctly call 
`wrapOutputStream`. 
   2. `hive-exec` is a fat JAR. It might be causing unit tests failure as it 
may introduce an older version of avro into the calsspath. We're currently 
trying to figure out how to address this. Let us know if you have any 
suggestions! 
   
   Btw - I also went to UIUC! Great to meet new Illini! 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to