huonw opened a new pull request #24019: [SPARK-27099][SQL] Add 'xxhash64' for 
hashing arbitrary columns to Long
URL: https://github.com/apache/spark/pull/24019
 
 
   ## What changes were proposed in this pull request?
   
   This introduces a new SQL function 'xxhash64' for getting a 64-bit hash of 
an arbitrary number of columns.
   
   This is designed to exactly mimic the 32-bit `hash`, which uses
   MurmurHash3. The name is designed to be more future-proof than the
   'hash', by indicating the exact algorithm used, similar to md5 and the
   sha hashes.
   
   ## How was this patch tested?
   
   The tests for the existing `hash` function were duplicated to run with 
`xxhash64`.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to