[ 
https://issues.apache.org/jira/browse/HBASE-3578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13092950#comment-13092950
 ] 

Jan Lukavsky commented on HBASE-3578:
-------------------------------------

Hi,

I think solution to this issue causes problems when job wants to change hbase 
specific options. Eg.

{noformat}
Configuration conf = HBaseConfiguration.create();

// change keyvalue size
conf.setInt("hbase.client.keyvalue.maxsize", 20971520);

Job job = new Job(conf, ...);

TableMapReduceUtil.initTableMapperJob(...);

// the job doesn't have the option changed, uses it from hbase-site or 
hbase-default
job.submit();

{noformat}

Although in this case it could be fixed by moving the set() after 
initTableMapperJob(), in case where user want's to change some option using 
GenericOptionsParser and -D this is impossible, making this cool feature 
useless.

In the 0.20.x era this code behaved as expected. The solution of this problem 
should be that we don't overwrite the options, but just read them if they are 
missing. I attached patch I think will fix this.




> TableInputFormat does not setup the configuration for HBase mapreduce jobs 
> correctly
> ------------------------------------------------------------------------------------
>
>                 Key: HBASE-3578
>                 URL: https://issues.apache.org/jira/browse/HBASE-3578
>             Project: HBase
>          Issue Type: Bug
>          Components: mapreduce
>    Affects Versions: 0.90.0, 0.90.1
>            Reporter: Dan Harvey
>            Assignee: Dan Harvey
>             Fix For: 0.92.0
>
>         Attachments: HBASE-3578.patch, HBASE-3578.patch, 
> mapreduce_configuration.patch
>
>
> In 0.20.x and earlier TableMapReduceUtil (and other Input/OutputFormat 
> classes) used to setup the HTable with a HBaseConfiguration object, now that 
> has been deprecated in #HBASE-2036 they are constructed with Hadoop 
> configuration objects which do not contain the configuration xml file 
> resources required to setup HBase. I think it is currently expected this is 
> done when constructing the job but as this needs to be done for every HBase 
> mapreduce job it would be cleaner if the TableMapReduceUtil class did this 
> whilst setting up the TableInput/OutputFormat classes. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to