cdmikechen opened a new issue #734: How to upsert data just with memory
URL: https://github.com/apache/incubator-hudi/issues/734
 
 
   I found that there is a `hoodie.write.status.storage.level` configuration in 
`HoodieWriteConfig` , so I tried to update a row in a hoodie table (750 rows 
and 400KB). But when using spark to update a row to hoodie , hoodie still 
shuffle data to disk and not only use memory, and it will take more time.
   ![image](https://github.com/cdmikechen/image/blob/master/20190614.png)
   I think if data is small , hoodie should process data in memory and not 
shuffle. Is there a way to let hoodie not to shuffle data?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to