[jira] [Commented] (HBASE-11558) Caching set on Scan object gets lost when using TableMapReduceUtil in 0.95+

Hadoop QA (JIRA) Mon, 28 Jul 2014 15:12:12 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-11558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14076992#comment-14076992
 ]


Hadoop QA commented on HBASE-11558:
-----------------------------------

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12658110/HBASE_11558-0.98_v2.patch
  against trunk revision .
  ATTACHMENT ID: 12658110

    {color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

    {color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

    {color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/10209//console

This message is automatically generated.

> Caching set on Scan object gets lost when using TableMapReduceUtil in 0.95+
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-11558
>                 URL: https://issues.apache.org/jira/browse/HBASE-11558
>             Project: HBase
>          Issue Type: Bug
>          Components: mapreduce, Scanners
>            Reporter: Ishan Chhabra
>            Assignee: Ishan Chhabra
>             Fix For: 0.99.0, 0.96.3, 0.98.5, 2.0.0
>
>         Attachments: HBASE_11558-0.96.patch, HBASE_11558-0.96_v2.patch, 
> HBASE_11558-0.98.patch, HBASE_11558-0.98_v2.patch, HBASE_11558.patch, 
> HBASE_11558_v2.patch
>
>
> 0.94 and before, if one sets caching on the Scan object in the Job by calling 
> scan.setCaching(int) and passes it to TableMapReduceUtil, it is correctly 
> read and used by the mappers during a mapreduce job. This is because 
> Scan.write respects and serializes caching, which is used internally by 
> TableMapReduceUtil to serialize and transfer the scan object to the mappers.
> 0.95+, after the move to protobuf, ProtobufUtil.toScan does not respect 
> caching anymore as ClientProtos.Scan does not have the field caching. Caching 
> is passed via the ScanRequest object to the server and so is not needed in 
> the Scan object. However, this breaks application code that relies on the 
> earlier behavior. This will lead to sudden degradation in Scan performance 
> 0.96+ for users relying on the old behavior.
> There are 2 options here:
> 1. Add caching to Scan object, adding an extra int to the payload for the 
> Scan object which is really not needed in the general case.
> 2. Document and preach that TableMapReduceUtil.setScannerCaching must be 
> called by the client.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HBASE-11558) Caching set on Scan object gets lost when using TableMapReduceUtil in 0.95+

Reply via email to