Cheolsoo Park created PIG-4074:
----------------------------------
Summary: mapreduce.client.submit.file.replication is not honored
in cached files
Key: PIG-4074
URL: https://issues.apache.org/jira/browse/PIG-4074
Project: Pig
Issue Type: Bug
Components: impl
Reporter: Cheolsoo Park
Assignee: Cheolsoo Park
Fix For: 0.14.0
Pig ships files to hdfs in several cases (e.g. replicated join, streaming
cached files, etc). But {{mapreduce.client.submit.file.replication}} (or
{{mapred.submit.replication}} for Hadoop 1.x) is not honored, and this has
performance impact since many tasks read the same hdfs blocks in a large
cluster.
--
This message was sent by Atlassian JIRA
(v6.2#6252)