[
https://issues.apache.org/jira/browse/HBASE-11484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056655#comment-14056655
]
Matteo Bertozzi commented on HBASE-11484:
-----------------------------------------
the manifest is just a list of file names. The restore step is required only
because the openRegion() is doing a fs.listStatus() to get the file list. if
instead of that you pass the manifest or in case of 94 the list of files you
can avoid to restore step.
> Provide a way in TableSnapshotInputFormat, not to restore the regions to a
> path for running MR every time, rather reuse a already restored path
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HBASE-11484
> URL: https://issues.apache.org/jira/browse/HBASE-11484
> Project: HBase
> Issue Type: New Feature
> Components: mapreduce
> Reporter: deepankar
> Priority: Minor
>
> We are trying to back a Hive Table by the Map Reduce over snapshots and we
> don't want to restore the snapshot to a restoreDir every time we want to
> execute a query. It would be nice if there is boolean in the function
> *TableSnapshotInputFormat.setInput* and exposed outside in the
> *TableMapReduceUtil.initTableSnapshotMapperJob*, with this boolean
> it will check whether the snapshot and the restore dir are in sync, rather
> than restoring again.
> Is this Idea looks Ok to you guys or you have any other suggestions, I will
> put up a patch for this if this idea is ok for guys
--
This message was sent by Atlassian JIRA
(v6.2#6252)