[GitHub] spark issue #21859: [SPARK-24900][SQL]Speed up sort when the dataset is smal...

cloud-fan Mon, 20 Aug 2018 18:23:48 -0700

Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/21859
  
    I don't think this optimization should be done at SQL layer. The 
`ShuffleWriter` should treat `RangePartitioner` specially and consume the 
sampled data in `RangePartitioner` instead of the input iterator.
    
    By doing that the SQL layer(as well as all other components) can benefit 
from it.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21859: [SPARK-24900][SQL]Speed up sort when the dataset is smal...

Reply via email to