[
https://issues.apache.org/jira/browse/HBASE-13356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512866#comment-14512866
]
Ted Yu commented on HBASE-13356:
--------------------------------
MultiTableSnapshotInputFormat.java and MultiTableSnapshotInputFormatImpl.java
need Apache license. Add annotation for audience.
There're several long lines - please limit line width to 100 characters.
{code}
125 * Sets up the job for reading from one or more multiple table
snapshots, with one or more scan per snapshot.
{code}
Should 'one or more multiple table snapshots' be 'one or more table snapshots' ?
nit: 'one or more scan' -> 'one or more scans'
{code}
26 public class MultiTableSnapshotInputFormatImpl {
27
28 private static final Log LOG =
LogFactory.getLog(MultiTableSnapshotInputFormat.class);
{code}
Classname for LOG doesn't match the real classname.
{code}
85 for (TableSnapshotInputFormatImpl.InputSplit split : splits) {
86 rtn.add(split);
87 }
{code}
Can you use
https://docs.oracle.com/javase/7/docs/api/java/util/List.html#addAll(java.util.Collection)
?
{code}
177 private Map<String, Path>
generateSnapshotToRestoreDir(Collection<String> snapshots, Path baseRestoreDir)
{
{code}
Name the method generateSnapshotToRestoreDirMapping().
> HBase should provide an InputFormat supporting multiple scans in mapreduce
> jobs over snapshots
> ----------------------------------------------------------------------------------------------
>
> Key: HBASE-13356
> URL: https://issues.apache.org/jira/browse/HBASE-13356
> Project: HBase
> Issue Type: New Feature
> Components: mapreduce
> Reporter: Andrew Mains
> Assignee: Andrew Mains
> Priority: Minor
> Attachments: HBASE-13356.patch
>
>
> Currently, HBase supports the pushing of multiple scans to mapreduce jobs
> over live tables (via MultiTableInputFormat) but only supports a single scan
> for mapreduce jobs over table snapshots. It would be handy to support
> multiple scans over snapshots as well, probably through another input format
> (MultiTableSnapshotInputFormat?). To mimic the functionality present in
> MultiTableInputFormat, the new input format would likely have to take in the
> names of all snapshots used in addition to the scans.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)