[GitHub] [incubator-hudi] cdmikechen opened a new issue #734: How to upsert data just with memory

GitBox Thu, 13 Jun 2019 18:29:23 -0700

cdmikechen opened a new issue #734: How to upsert data just with memory
URL: https://github.com/apache/incubator-hudi/issues/734
 
 
   I found that there is a `hoodie.write.status.storage.level` configuration in 
`HoodieWriteConfig` , so I tried to update a row in a hoodie table (750 rows 
and 400KB). But when using spark to update a row to hoodie , hoodie still 
shuffle data to disk and not only use memory, and it will take more time.
   ![image](https://github.com/cdmikechen/image/blob/master/20190614.png)
   I think if data is small , hoodie should process data in memory and not 
shuffle. Is there a way to let hoodie not to shuffle data?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

[GitHub] [incubator-hudi] cdmikechen opened a new issue #734: How to upsert data just with memory

Reply via email to