If #link is missing from uri format of -cacheArchive then streaming does not
throw error.
-----------------------------------------------------------------------------------------
Key: HADOOP-2879
URL: https://issues.apache.org/jira/browse/HADOOP-2879
Project: Hadoop Core
Issue Type: Bug
Components: contrib/streaming
Reporter: Karam Singh
Priority: Minor
Ran hadoop streaming command as -:
bin/hadoop jar contrib/streaming/hadoop-*-streaming.jar -input in -output out
-mapper "xargs cat" -reducer "bin/cat" -cahceArchive hdfs://h:p/pathofJarFile
Streaming submits job to jobtracker and map fails.
For similar with -cacheFile -:
bin/hadoop jar contrib/streaming/hadoop-*-streaming.jar -input in -output out
-mapper "xargs cat" -reducer "bin/cat" -cahceFile hdfs://h:p/pathofFile
followinng error is repoerted back -:
[
You need to specify the uris as hdfs://host:port/#linkname,Please specify a
different link name for all of your caching URIs
]
Streaming should check about present #link after uri of cacheArchive and should
throw proper error .
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.