[ 
https://issues.apache.org/jira/browse/HBASE-11484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056612#comment-14056612
 ] 

Matteo Bertozzi commented on HBASE-11484:
-----------------------------------------

-1 on this approach.

you can just change the ClientSideRegionScanner to take the region manifest and 
add the files to the HRegion object instead of using HRegion.openHRegion(). It 
requires a couple of changes in HRegion

anyway, what is the real motivation? restore doesn't copy the data, so it is 
not a space problem. and unless you restore a snapshot every 5 sec the number 
of calls to the NN shouldn't be the problem.

> Provide a way in TableSnapshotInputFormat, not to restore the regions to a 
> path for running MR every time, rather reuse a already restored path
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-11484
>                 URL: https://issues.apache.org/jira/browse/HBASE-11484
>             Project: HBase
>          Issue Type: New Feature
>          Components: mapreduce
>            Reporter: deepankar
>            Priority: Minor
>
> We are trying to back a Hive Table by the Map Reduce over snapshots  and we 
> don't want to restore the snapshot to a restoreDir every time we want to 
> execute a query. It would be nice if there is boolean in the function 
> * TableSnapshotInputFormat.setInput * and exposed outside in the
> * TableMapReduceUtil.initTableSnapshotMapperJob *, with this boolean
> it will check whether the snapshot and the restore dir are in sync, rather 
> than restoring again. 
> Is this Idea looks Ok to you guys or you have any other suggestions, I will 
> put up a patch for this if this idea is ok for guys



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to