[GitHub] [spark] siknezevic commented on issue #27246: [SPARK-30536][CORE][SQL] Sort-merge join operator spilling performance improvements

GitBox Wed, 22 Jan 2020 18:53:02 -0800

siknezevic commented on issue #27246: [SPARK-30536][CORE][SQL] Sort-merge join 
operator spilling performance improvements
URL: https://github.com/apache/spark/pull/27246#issuecomment-577481768
 
 
   The small data set will fit into executor memory, so there is no need to 
spill. I tested with just few records and it works fine. This solution will be 
faster for small data set when comparing with the current Spark spilling. My 
focus are bigger data sets and this solution will greatly improve spilling 
performance in those cases.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] siknezevic commented on issue #27246: [SPARK-30536][CORE][SQL] Sort-merge join operator spilling performance improvements

Reply via email to