Cheng Lian created HIVE-14294:
---------------------------------
Summary: HiveSchemaConverter for Parquet doesn't translate TINYINT
and SMALLINT into proper Parquet types
Key: HIVE-14294
URL: https://issues.apache.org/jira/browse/HIVE-14294
Project: Hive
Issue Type: Bug
Affects Versions: 2.1.0, 1.2.1
Reporter: Cheng Lian
To reproduce this issue, run the following DDL:
{code:sql}
CREATE TABLE foo STORED AS PARQUET AS SELECT CAST(1 AS TINYINT);
{code}
And then check the schema of the written Parquet file:
{noformat}
$ parquet-schema $WAREHOUSE_PATH/foo/000000_0
message hive_schema {
optional int32 _c0;
}
{noformat}
When translating Hive types into Parquet types, {{TINYINT}} and {{SMALLINT}}
should be translated into the {{int32 (INT_8)}} and {{int32 (INT_16)}}
respectively. However, {{HiveSchemaConverter}} converts all of {{TINYINT}},
{{SMALLINT}}, and {{INT}} into Parquet {{int32}}. This causes problem when
accessing Parquet files generated by Hive in other systems since type
information gets wrong.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)