[ https://issues.apache.org/jira/browse/HIVE-895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12889558#action_12889558 ]
Alex Rovner commented on HIVE-895: ---------------------------------- Can some one please explain to me how would this serde work? Specifically how would it deserialize the data? >From what I understand Avro file has a header that defines the data that is >stored in the file. In order to deserialize the data you need to read the >header which is a challenge in Hive's Deserialize interface because the >initialize() method does not know anything about the input file. (Note: there >is a hack that can get you the file by getting the map.input hadoop >property.... this hack however is not good enough in hive because some one >might be using the CLI to query which will not trigger a map reduce job. Does anyone know a good solution to this issue? I am actually trying to implements a different file format but the idea of our format is similar to Avro: Each file has a header in which it contains a "schema" Thanks > Add SerDe for Avro serialized data > ---------------------------------- > > Key: HIVE-895 > URL: https://issues.apache.org/jira/browse/HIVE-895 > Project: Hadoop Hive > Issue Type: New Feature > Components: Serializers/Deserializers > Reporter: Jeff Hammerbacher > > As Avro continues to mature, having a SerDe to allow HiveQL queries over Avro > data seems like a solid win. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.