[
https://issues.apache.org/jira/browse/HBASE-10546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13903567#comment-13903567
]
Lars Hofhansl commented on HBASE-10546:
---------------------------------------
Unless there are objections I'll commit to trunk version of the patch to all
branches. We can always go back later and remove spurious calls to initialize.
Right now I'd prefer to keep them ([~vmariyala]).
> Two scanner objects are open for each hbase map task but only one scanner
> object is closed
> ------------------------------------------------------------------------------------------
>
> Key: HBASE-10546
> URL: https://issues.apache.org/jira/browse/HBASE-10546
> Project: HBase
> Issue Type: Bug
> Reporter: Vasu Mariyala
> Fix For: 0.96.2, 0.98.1, 0.99.0, 0.94.17
>
> Attachments: 0.94-HBASE-10546.patch, trunk-HBASE-10546.patch
>
>
> Map reduce framework calls createRecordReader of the
> TableInputFormat/MultiTableInputFormat to get the record reader instance. In
> this method, we are initializing the TableRecordReaderImpl (restart method).
> This initializes the scanner object. After this, map reduce framework calls
> initialize on the RecordReader. In our case, this calls restart of the
> TableRecordReaderImpl again. Here, it doesn't close the first scanner. At the
> end of the task, only the second scanner object is closed. Because of this,
> the smallest read point of HRegion is affected.
> We don't need to initialize the RecordReader in the createRecordReader method
> and we need to close the scanner object in the restart method. (incase if the
> restart method is called because of exceptions in the nextKeyValue method)
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)