[ 
https://issues.apache.org/jira/browse/PIG-856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721577#action_12721577
 ] 

Olga Natkovich commented on PIG-856:
------------------------------------

If a job fails, the store connected to this job will fail as well. Pig has no 
retries beyond what hadoop provides. That's why no replication seems a little 
risky but I want to see what the perf difference is and whether it is worth the 
risk.

> PERFORMANCE: reduce number of replicas
> --------------------------------------
>
>                 Key: PIG-856
>                 URL: https://issues.apache.org/jira/browse/PIG-856
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.3.0
>            Reporter: Olga Natkovich
>
> Currently Pig uses the default number of replicas between MR jobs. Currently, 
> the number is 3. Given the temp nature of the data, we should never need more 
> than 2 and should explicitely set it to improve performance and to be nicer 
> to the name node.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to