[
https://issues.apache.org/jira/browse/HIVE-6572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sushanth Sowmyan updated HIVE-6572:
-----------------------------------
Description:
HadoopShims has a method to fetch config parameters by name so that they return
the appropriate config param name for the appropriate hadoop version. We need
to be consistent about using these versions.
For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead
called mapreduce.input.fileinputformat.split.minsize .
Also, there is a bug in Hadoop23Shims, Hadoop20SShims and Hadoop20Shims that
defines MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and
MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong and
confusing.
was:
HadoopShims has a method to fetch config parameters by name so that they return
the appropriate config param name for the appropriate hadoop version. We need
to be consistent about using these versions.
For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead
called mapreduce.input.fileinputformat.split.minsize .
Also, there is a bug in Hadoop20SShims and Hadoop20Shims that defines
MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and
MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong and
confusing.
> Use shimmed version of hadoop conf names for mapred.{min,max}.split.size{.*}
> ----------------------------------------------------------------------------
>
> Key: HIVE-6572
> URL: https://issues.apache.org/jira/browse/HIVE-6572
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.13.0, 0.14.0
> Reporter: Sushanth Sowmyan
> Assignee: Sushanth Sowmyan
> Attachments: HIVE-6572.patch
>
>
> HadoopShims has a method to fetch config parameters by name so that they
> return the appropriate config param name for the appropriate hadoop version.
> We need to be consistent about using these versions.
> For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead
> called mapreduce.input.fileinputformat.split.minsize .
> Also, there is a bug in Hadoop23Shims, Hadoop20SShims and Hadoop20Shims that
> defines MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and
> MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong
> and confusing.
--
This message was sent by Atlassian JIRA
(v6.2#6252)