[
https://issues.apache.org/jira/browse/MAPREDUCE-2845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13115434#comment-13115434
]
Christoph Schmitz commented on MAPREDUCE-2845:
----------------------------------------------
My original intent was to have the system automatically do something like
mapred.submit.replication = min(10, number_of_nodes), so that job submission
would not require setting mapred.submit.replication manually to avoid the
under-replication warnings (on small clusters).
I see your point, though, that that would cause depencencies between HDFS and
Map/Reduce.
> Default replication level mapred.submit.replication=10 causes warnings on
> small clusters
> ----------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-2845
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2845
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: client, distributed-cache
> Affects Versions: 0.20.1
> Environment: Cloudera CDH 2 (hadoop-0.20 0.20.1+169.127-1~lenny-cdh2)
> Reporter: Christoph Schmitz
> Priority: Minor
>
> By default, the replication level for job jars, libjars and the distributed
> cache in general is mapred.submit.replication=10. This yields
> under-replication warnings for these files on small clusters (less than 10
> data nodes) when using fsck ("hadoop fsck") on their HDFS.
> Example on an 8-node cluster:
> {quote}
> /tmp/hadoop/mapred/system/job_201105191458_1857/job.jar: Under replicated
> blk_-6996370258385460742_366223. Target Replicas is 10 but found 8 replica(s).
> {quote}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira