[
https://issues.apache.org/jira/browse/HBASE-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matteo Bertozzi updated HBASE-7880:
-----------------------------------
Attachment: HBASE-7880-v0.patch
Attached a quick and dirty patch, to rewrite and recover an hfile. We may also
add a mapreduce support like the compaction tool to give the ability to specify
a source directory and recover all the files in there by distributing the files
recovery.
HFile Reader/Scanner have a strong dependency on the trailer and the index,
that makes difficult to reuse some code to scan just the block, maybe we can
refactor the code a bit to isolate some stuff (like reading key/values) that
don't really need the trailer dependecy.
> HFile Recovery/Rewrite Tool
> ---------------------------
>
> Key: HBASE-7880
> URL: https://issues.apache.org/jira/browse/HBASE-7880
> Project: HBase
> Issue Type: New Feature
> Components: HFile
> Affects Versions: 0.96.0
> Reporter: Matteo Bertozzi
> Assignee: Matteo Bertozzi
> Priority: Minor
> Attachments: HBASE-7880-v0.patch
>
>
> Sometimes is useful to have a tool to migrate files from a new version to an
> old version (e.g. convert a new XYZ encoded/compressed file to an old
> "uncompressed" format)
> also it will be useful to been able to recover an hfile from a corrupted
> state. (e.g. trailer missing/broken, ...)
> The "user" can provide the information about the file (compression & co) and
> try to recover as much as possible from the file by reading data blocks.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira