[
https://issues.apache.org/jira/browse/HBASE-11484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056612#comment-14056612
]
Matteo Bertozzi commented on HBASE-11484:
-----------------------------------------
-1 on this approach.
you can just change the ClientSideRegionScanner to take the region manifest and
add the files to the HRegion object instead of using HRegion.openHRegion(). It
requires a couple of changes in HRegion
anyway, what is the real motivation? restore doesn't copy the data, so it is
not a space problem. and unless you restore a snapshot every 5 sec the number
of calls to the NN shouldn't be the problem.
> Provide a way in TableSnapshotInputFormat, not to restore the regions to a
> path for running MR every time, rather reuse a already restored path
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HBASE-11484
> URL: https://issues.apache.org/jira/browse/HBASE-11484
> Project: HBase
> Issue Type: New Feature
> Components: mapreduce
> Reporter: deepankar
> Priority: Minor
>
> We are trying to back a Hive Table by the Map Reduce over snapshots and we
> don't want to restore the snapshot to a restoreDir every time we want to
> execute a query. It would be nice if there is boolean in the function
> * TableSnapshotInputFormat.setInput * and exposed outside in the
> * TableMapReduceUtil.initTableSnapshotMapperJob *, with this boolean
> it will check whether the snapshot and the restore dir are in sync, rather
> than restoring again.
> Is this Idea looks Ok to you guys or you have any other suggestions, I will
> put up a patch for this if this idea is ok for guys
--
This message was sent by Atlassian JIRA
(v6.2#6252)