[
https://issues.apache.org/jira/browse/HBASE-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lars Hofhansl updated HBASE-10642:
----------------------------------
Attachment: 10642-0.94-v4.txt
V4 has the block distribution logic, and scan metrics, but keeps changes to
existing HBase classes to a minimum.
[~enis], could have a look, especially at the block distribution stuff? (The
existing 0.94 picked up the distribution from the table, not the snapshot, I am
not sure the HFileLinks influence this and whether even the trunk patch does
the right thing).
Also, in the trunk version I notice that we update the counters after each
record, is that by design? Seems CPU heavy.
> Add M/R over snapshots to 0.94
> ------------------------------
>
> Key: HBASE-10642
> URL: https://issues.apache.org/jira/browse/HBASE-10642
> Project: HBase
> Issue Type: Bug
> Reporter: Lars Hofhansl
> Fix For: 0.94.18
>
> Attachments: 10642-0.94-v2.txt, 10642-0.94-v3.txt, 10642-0.94-v4.txt,
> 10642-0.94.txt, SnapshotInputFormat.java
>
>
> I think we want drive towards all (or most) M/R over HBase to be against
> snapshots and HDFS directly.
> Adopting a simple input format (even if just as a sample) as part of HBase
> will allow us to direct users this way.
--
This message was sent by Atlassian JIRA
(v6.2#6252)