Github user jinxing64 commented on the issue:

    https://github.com/apache/spark/pull/21212
  
    @squito @cloud-fan @jiangxb1987
    Thanks a lot for review.
    
    > shall we also optimize the space usage for MapStatus
    
    @cloud-fan do you mean optimize space usage for MapStatus when there are 
lots of consecutive empty-blocks ?
    
    Rashid, thanks a lot for your comments. I refined the doc and did some more 
improvement for memory usage. Please take another look.
    >curious, have you observed this using a lot of memory from a heapdump or 
anything?
    
    Yes, it's from heapdump. We enabled adaptive execution and found this issue.
    
    > I wonder if using an ArrayBuffer is the smartest, maybe it'll have a ton 
of wasted space from the way it grows.
    
    I'm not sure about this either. So I just keep the ArrayBuffer here.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to