streaming doesn't support jobclient.output.filter
-------------------------------------------------
Key: HADOOP-6144
URL: https://issues.apache.org/jira/browse/HADOOP-6144
Project: Hadoop Common
Issue Type: Bug
Affects Versions: 0.20.0
Environment: Linux
Reporter: Alok Singh
the streaming Jobclient implementation i.e
contrib/streaming/src/java/org/apache/hadoop/streaming/StreamJob.java is
significantly different than the core hadoop
mapred/org/apache/hadoop/mapred/JobClient.java.
for example unlike StreamJob.java, JobClient.java it gets tasks log when
jobclient.output.filter=ALL is specified .
With hod-logs going away in hadoop 0.20 (due to new scheduler) user has no good
way of programmitically getting logs
We should have intermediate adaptor class to implement Tools for the purpose of
submitting jobs via m/r, streaming, pipes so that we don't miss some core
functionality.
GenericJobClient implements Tools and then StreamJob extends GenericJobClient,
JobClient extends GenericJobClient
Alok
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.