runzhiwang opened a new pull request #232: [LIVY-667][WIP] Collecting a part of 
partition to the driver by batch to avoid OOM
URL: https://github.com/apache/incubator-livy/pull/232
 
 
   ## What changes were proposed in this pull request?
   
   Collecting a part of partition to the driver by batch to avoid OOM
   
   Background:
   1. When enable livy.server.thrift.incrementalCollect, thrift use 
toLocalIterator to load one partition at each time instead of the whole rdd to 
avoid OutOfMemory. However, if the largest partition is too big, the 
OutOfMemory still occurs.
   
   2. This PR collect a part of partition to the driver by batch at each time 
to avoid OOM.
   
   ## How was this patch tested?
   
    create a big size of data into one partition and query them all.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to