[ https://issues.apache.org/jira/browse/HIVE-8359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14211431#comment-14211431 ]
Ryan Blue commented on HIVE-8359: --------------------------------- Thanks, Brock. De-duplicating this and HIVE-6994 is on my list of to-do items for today. I really like the tests that Sergio has put together for this, but I still need to see what Mickael has done that isn't done by Sergio's patch. I think it would be good if we all looked over both and then discussed what is needed for both of them. > Map containing null values are not correctly written in Parquet files > --------------------------------------------------------------------- > > Key: HIVE-8359 > URL: https://issues.apache.org/jira/browse/HIVE-8359 > Project: Hive > Issue Type: Bug > Components: File Formats > Affects Versions: 0.13.1 > Reporter: Frédéric TERRAZZONI > Assignee: Sergio Peña > Attachments: HIVE-8359.1.patch, map_null_val.avro > > > Tried write a map<string,string> column in a Parquet file. The table should > contain : > {code} > {"key3":"val3","key4":null} > {"key3":"val3","key4":null} > {"key1":null,"key2":"val2"} > {"key3":"val3","key4":null} > {"key3":"val3","key4":null} > {code} > ... and when you do a query like {code}SELECT * from mytable{code} > We can see that the table is corrupted : > {code} > {"key3":"val3"} > {"key4":"val3"} > {"key3":"val2"} > {"key4":"val3"} > {"key1":"val3"} > {code} > I've not been able to read the Parquet file in our software afterwards, and > consequently I suspect it to be corrupted. > For those who are interested, I generated this Parquet table from an Avro > file. -- This message was sent by Atlassian JIRA (v6.3.4#6332)