Eric Chu created HIVE-6210:
------------------------------

             Summary: Default serde for RCFile has changed
                 Key: HIVE-6210
                 URL: https://issues.apache.org/jira/browse/HIVE-6210
             Project: Hive
          Issue Type: Bug
          Components: File Formats
    Affects Versions: 0.12.0
            Reporter: Eric Chu


In Hive 10 when I create a table in RCFile, the serde is 
org.apache.hadoop.hive.serde2.columnar.ColumnarSerDe

In Hive 12 when I do the same thing, the serde becomes 
org.apache.hadoop.hive.serde2.columnar.LazyBinaryColumnarSerDe

Similarly, in Hive 12, when I set FILEFORMAT to RCFILE, the serde will become 
LazyBinaryColumnarSerDe, as opposed to ColumnarSerDe in previous versions. What 
is the reason behind a change? This seems like a regression bug to me.

Normally, we can work around the issue by explicitly setting the table serde to 
be org.apache.hadoop.hive.serde2.columnar.ColumnarSerDe. However, this causes a 
problem for our migration to ORC. Specifically, we have a partitioned table for 
which we want the new partitions to have locations pointing to ORC partitions, 
and the old partitions to have locations pointing to RCFILE partitions. 
Moreover, we need the ability to change the location of a partition to point to 
RCFILE partition. For this we'd do so by doing SET FILEFORMAT RCFILE. However, 
b/c of this serde problem the RCFile partition in an ORC table will have the 
wrong serde, and ALTER TABLE doesn't allow us to set serde for a partition. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to