[
https://issues.apache.org/jira/browse/MAPREDUCE-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789081#action_12789081
]
ZhuGuanyin commented on MAPREDUCE-1277:
---------------------------------------
I think the framework should not care what the characterset of the input and
user log, may be the input or output has more than one characterset.
what hadoop need to do is read raw data for user mapper or reducer, collect raw
stdout and stderr data and save them on hdfs or tasktracker local disk.
raw in, raw out, no matter what characterset it is.
> Streaming job should support other characterset in user's stderr log, not
> only utf8
> -----------------------------------------------------------------------------------
>
> Key: MAPREDUCE-1277
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1277
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/streaming
> Affects Versions: 0.21.0
> Reporter: ZhuGuanyin
> Assignee: ZhuGuanyin
> Fix For: 0.21.0
>
> Attachments: streaming-1277.patch
>
>
> Current implementation in streaming only support utf8 encoded user stderr
> log, it should encode free to support other characterset.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.