[jira] Commented: (HADOOP-3566) Create an InputFormat for reading lines of text as Java Strings

Owen O'Malley (JIRA) Wed, 25 Jun 2008 14:53:37 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-3566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12608212#action_12608212
 ]


Owen O'Malley commented on HADOOP-3566:
---------------------------------------

Could we use

{code}
   K getKey() throws IOException;
   V getValue() throws IOException;
{code}

We should probably change the context objects for InputFormats this way too.

The whole Long,String signature for TextInputFormat has always kind of annoyed 
me. No one ever uses those offsets and they mean that TextInputFormat, 
IdentityMapper, IdentityReducer, TextOutputFormat doesn't do the "sort" that 
people expect.



> Create an InputFormat for reading lines of text as Java Strings
> ---------------------------------------------------------------
>
>                 Key: HADOOP-3566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3566
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Tom White
>            Assignee: Tom White
>         Attachments: hadoop-3566.patch
>
>
> Such a StringInputFormat would be like TextInputFormat but with input types 
> of Long and String, rather than LongWritable and Text. This would allow users 
> to write MapReduce programs that used only Java native types (i.e. no 
> Writables).
> This is currently not possible to write without changes to Hadoop due to a 
> limitation in the RecordReader interface explained here: 
> https://issues.apache.org/jira/browse/HADOOP-3413?focusedCommentId=12597935#action_12597935

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-3566) Create an InputFormat for reading lines of text as Java Strings

Reply via email to