Tejas Patil created SPARK-17495:
-----------------------------------

             Summary: Hive hash implementation
                 Key: SPARK-17495
                 URL: https://issues.apache.org/jira/browse/SPARK-17495
             Project: Spark
          Issue Type: Bug
          Components: SQL
            Reporter: Tejas Patil
            Priority: Minor


Spark internally uses Murmur3Hash for partitioning. This is different from the 
one used by Hive. For queries which use bucketing this leads to different 
results if one tries the same query on both engines. For us, we want users to 
have backward compatibility to that one can switch parts of applications across 
the engines without observing regressions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to