[
https://issues.apache.org/jira/browse/HIVE-12362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15453014#comment-15453014
]
Lenni Kuff commented on HIVE-12362:
-----------------------------------
I don't have a test case available to confirm this, it was only done by looking
at the code so have not confirmed. Seems that there is extra working happening
for each column value in each row, so could have a possible performance impact.
> Hive's Parquet SerDe ignores 'serialization.null.format' property
> -----------------------------------------------------------------
>
> Key: HIVE-12362
> URL: https://issues.apache.org/jira/browse/HIVE-12362
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 1.1.0
> Reporter: Naveen Gangam
> Assignee: Naveen Gangam
> Attachments: HIVE-12362.2.patch, HIVE-12362.patch
>
>
> {code}
> create table src (a string);
> insert into table src values (NULL), (''), ('');
> 0: jdbc:hive2://localhost:10000/default> select * from src;
> +-----------+--+
> | src.a |
> +-----------+--+
> | NULL |
> | |
> | |
> +-----------+--+
> create table dest (a string) row format serde
> 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' stored as
> INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'
> OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat';
> alter table dest set SERDEPROPERTIES ('serialization.null.format' = '');
> alter table dest set TBLPROPERTIES ('serialization.null.format' = '');
> insert overwrite table dest select * from src;
> 0: jdbc:hive2://localhost:10000/default> select * from test11;
> +-----------+--+
> | test11.a |
> +-----------+--+
> | NULL |
> | |
> | |
> +-----------+--+
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)