Rui Li created HIVE-8017:
----------------------------

             Summary: Use HiveKey instead of Byteswritable as key type of the 
pair RDD [Spark Branch]
                 Key: HIVE-8017
                 URL: https://issues.apache.org/jira/browse/HIVE-8017
             Project: Hive
          Issue Type: Bug
          Components: Spark
            Reporter: Rui Li
            Assignee: Rui Li


HiveKey should be used as the key type because it holds the hash code for 
partitioning. While BytesWritable serves partitioning well for simple cases, we 
have to use {{HiveKey.hashCode}} for more complicated ones, e.g. join, bucketed 
table, etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to