-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/24377/
-----------------------------------------------------------

(Updated Aug. 6, 2014, 9:11 a.m.)


Review request for hive.


Bugs: HIVE-7142
    https://issues.apache.org/jira/browse/HIVE-7142


Repository: hive-git


Description
-------

Currently Hive only support serialize data into UTF-8 charset bytes or 
deserialize from UTF-8 bytes, real world users may want to load different kinds 
of encoded data into hive directly. This jira is dedicated to support 
serialize/deserialize all kinds of encoded data in SerDe layer.
For user, only need to configure serialization encoding on table level by set 
serialization encoding through serde parameter, for example:
CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE 
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH 
SERDEPROPERTIES("serialization.encoding"='GBK');
or
ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); 
LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in 
this patch.


Diffs (updated)
-----

  serde/src/java/org/apache/hadoop/hive/serde2/AbstractEncodingAwareSerDe.java 
PRE-CREATION 
  serde/src/java/org/apache/hadoop/hive/serde2/DelimitedJSONSerDe.java 179f9b5 
  serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java b7fb048 
  serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazySimpleSerDe.java 
fb55c70 

Diff: https://reviews.apache.org/r/24377/diff/


Testing
-------


Thanks,

chengxiang li

Reply via email to