[jira] Commented: (MAPREDUCE-1277) Streaming job should support other characterset in user's stderr log, not only utf8

ZhuGuanyin (JIRA) Thu, 10 Dec 2009 18:28:42 -0800

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789081#action_12789081
 ]


ZhuGuanyin commented on MAPREDUCE-1277:
---------------------------------------

I think the framework should not care what the characterset of the input and 
user log, may be the input or output has more than one characterset.

what hadoop need to do is read raw data for user mapper or reducer, collect raw 
stdout and stderr data and save them on hdfs or tasktracker local disk.

raw in, raw out, no matter what characterset it is.

> Streaming job should support other characterset in user's stderr log, not 
> only utf8
> -----------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1277
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1277
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/streaming
>    Affects Versions: 0.21.0
>            Reporter: ZhuGuanyin
>            Assignee: ZhuGuanyin
>             Fix For: 0.21.0
>
>         Attachments: streaming-1277.patch
>
>
> Current implementation in streaming  only support utf8 encoded user stderr 
> log, it should encode free to support other characterset.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1277) Streaming job should support other characterset in user's stderr log, not only utf8

Reply via email to