[ 
https://issues.apache.org/jira/browse/HADOOP-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628188#action_12628188
 ] 

Joydeep Sen Sarma commented on HADOOP-3787:
-------------------------------------------

@Tom - the other thing is that SequenceFiles are self-describing - so the 
createKey()/Value() methods are trivial. For flat binary files - the record 
that's serialized is not implicit in the file and has to come from 
configuration outside. 

In Hive we have configuration per inputpath (or path-prefix really) that 
indicates the same information that's embedded inside sequencefile header. i am 
not sure whether we want to have this kind of information as part of 
hadoop-core.

will open a separate jira for binary flat files (opening corresponding one for 
Hive as well since this is one of the first requests we got) 

> Add serialization for Thrift
> ----------------------------
>
>                 Key: HADOOP-3787
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3787
>             Project: Hadoop Core
>          Issue Type: Wish
>          Components: examples, mapred
>            Reporter: Tom White
>         Attachments: hadoop-3787.patch, libthrift.jar
>
>
> Thrift (http://incubator.apache.org/thrift/) is cross-language serialization 
> and RPC framework. This issue is to write a ThriftSerialization to support 
> using Thrift types in MapReduce programs, including an example program. This 
> should probably go into contrib.
> (There is a prototype implementation in 
> https://issues.apache.org/jira/secure/attachment/12370464/hadoop-serializer-v2.tar.gz)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to