[ https://issues.apache.org/jira/browse/HBASE-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242584#comment-13242584 ]
Lars Hofhansl commented on HBASE-5604: -------------------------------------- Ok... I'll do a separate tool called WALPlayer. The Import rationale would be that indeed right now it can import HLog files. I added HFiles output to Import in HBASE-5440. HDFS would probably not work, as these files would be copied around for archiving. Changing the names of WALs is interesting. I'd be worried about side-effects to other tools. Maybe it's not even an issue. A reader of the HLog can tell after looking at the first log entries whether it can ignore the rest of the file. > HLog replay tool that generates HFiles for use by LoadIncrementalHFiles. > ------------------------------------------------------------------------ > > Key: HBASE-5604 > URL: https://issues.apache.org/jira/browse/HBASE-5604 > Project: HBase > Issue Type: New Feature > Reporter: Lars Hofhansl > > Just an idea I had. Might be useful for restore of a backup using the HLogs. > This could an M/R (with a mapper per HLog file). > The tool would get a timerange and a (set of) table(s). We'd pick the right > HLogs based on time before the M/R job is started and then have a mapper per > HLog file. > The mapper would then go through the HLog, filter all WALEdits that didn't > fit into the time range or are not any of the tables and then uses > HFileOutputFormat to generate HFiles. > Would need to indicate the splits we want, probably from a live table. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira