Cheolsoo Park created PIG-4074: ---------------------------------- Summary: mapreduce.client.submit.file.replication is not honored in cached files Key: PIG-4074 URL: https://issues.apache.org/jira/browse/PIG-4074 Project: Pig Issue Type: Bug Components: impl Reporter: Cheolsoo Park Assignee: Cheolsoo Park Fix For: 0.14.0
Pig ships files to hdfs in several cases (e.g. replicated join, streaming cached files, etc). But {{mapreduce.client.submit.file.replication}} (or {{mapred.submit.replication}} for Hadoop 1.x) is not honored, and this has performance impact since many tasks read the same hdfs blocks in a large cluster. -- This message was sent by Atlassian JIRA (v6.2#6252)