Sergio Peña created HIVE-8577:
---------------------------------
Summary: Cannot deserialize Avro schema with a map<string,string>
with null values
Key: HIVE-8577
URL: https://issues.apache.org/jira/browse/HIVE-8577
Project: Hive
Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Sergio Peña
An avro table with a map<string,string> column that contains null values
cannot be deserialized when running the select statement.
Create the following table:
{noformat}
CREATE TABLE avro_table (avreau_col_1 map<string,string>)
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.avro.AvroSerDe' STORED AS
INPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat'
OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat'
TBLPROPERTIES ('avro.schema.url'='file:///tmp/map_null_schema.avro');
{noformat}
Then load the avro data:
{noformat}
LOAD DATA LOCAL INPATH '/tmp/map_null_val.avro' OVERWRITE INTO TABLE avro_table;
{noformat}
And do the select (it fails):
{noformat}
SELECT * FROM avro_table;
Error: java.io.IOException: org.apache.avro.AvroRuntimeException: Not a map:
"null" (state=,code=0)
{noformat}
This is a regression bug (it works correctly on hive 0.13.1 version).
This is the output that hive 0.13.1 displays:
{noformat}
{"key3":"val3","key4":null}
{"key3":"val3","key4":null}
{"key1":null,"key2":"val2"}
{"key3":"val3","key4":null}
{"key3":"val3","key4":null}
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)