[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13096751#comment-13096751
 ] 

Harsh J commented on MAPREDUCE-2845:
------------------------------------

The property 'mapred.submit.replication' is configurable for that reason; or is 
this a request to lower that value?

Indeed, this would also hamper decommissioning efforts when an admin doesn't 
know about it.

I think making it dynamic makes sense, but adds RPC overheads possibly, per 
submission.

> Default replication level mapred.submit.replication=10 causes warnings on 
> small clusters
> ----------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2845
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2845
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, distributed-cache
>    Affects Versions: 0.20.1
>         Environment: Cloudera CDH 2 (hadoop-0.20 0.20.1+169.127-1~lenny-cdh2)
>            Reporter: Christoph Schmitz
>            Priority: Minor
>
> By default, the replication level for job jars, libjars and the distributed 
> cache in general is mapred.submit.replication=10. This yields 
> under-replication warnings for these files on small clusters (less than 10 
> data nodes) when using fsck ("hadoop fsck") on their HDFS.
> Example on an 8-node cluster:
> {quote}
> /tmp/hadoop/mapred/system/job_201105191458_1857/job.jar:  Under replicated 
> blk_-6996370258385460742_366223. Target Replicas is 10 but found 8 replica(s).
> {quote}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to