In the latest release, no; however I've filed a ticket here
https://issues.apache.org/jira/browse/HBASE-13356 for this feature, and
uploaded a patch for review.
The patch provides a MultiTableSnapshotInputFormat which can run a list
of scans over multiple snapshots. Jobs can be initialized using:
public static void initMultiTableSnapshotMapperJob(Map<String,
Collection<Scan>> snapshotScans,
Class<? extends TableMapper> mapper, Class<?> outputKeyClass, Class<?>
outputValueClass,
Job job, boolean addDependencyJars, Path tmpRestoreDir) throws
IOException {
Hope this helps!
Andrew
On 5/22/15 2:35 AM, Shi, Shaofeng wrote:
Hello,
We have a scenario which need merge multiple Hbase tables into one table
periodically; To gain better performance and minimal the impact to HBase
server, we are evaluating the method of using TableSnapshotInputFormat
(http://www.slideshare.net/enissoz/mapreduce-over-snapshots); But from the API
we see it only allows one snapshot as input; Is it possible to change it to
allow multiple snapshots?
Thanks in advance for any advise;
Shaofeng Shi
Apache Kylin