GitHub user MaxGekk opened a pull request:
https://github.com/apache/spark/pull/20793
[SPARK-23643] Shrinking the buffer in hashSeed up to size of the seed
parameter
## What changes were proposed in this pull request?
The hashSeed method allocates 64 bytes instead of 8. Other bytes are always
zeros. And they could be excluded from hash calculation because they don't
differentiate inputs.
## How was this patch tested?
By running the existing tests
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/MaxGekk/spark-1 hash-buff-size
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20793.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20793
----
commit bb40ef2e8d337508d60903a6a824b5aa45d87326
Author: Maxim Gekk <maxim.gekk@...>
Date: 2018-03-10T13:14:33Z
Shrinking the buffer up to size of the long type
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]