[ 
https://issues.apache.org/jira/browse/HBASE-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-7880:
-----------------------------------

    Attachment: HBASE-7880-v0.patch

Attached a quick and dirty patch, to rewrite and recover an hfile. We may also 
add a mapreduce support like the compaction tool to give the ability to specify 
a source directory and recover all the files in there by distributing the files 
recovery.

HFile Reader/Scanner have a strong dependency on the trailer and the index, 
that makes difficult to reuse some code to scan just the block, maybe we can 
refactor the code a bit to isolate some stuff (like reading key/values) that 
don't really need the trailer dependecy.
                
> HFile Recovery/Rewrite Tool
> ---------------------------
>
>                 Key: HBASE-7880
>                 URL: https://issues.apache.org/jira/browse/HBASE-7880
>             Project: HBase
>          Issue Type: New Feature
>          Components: HFile
>    Affects Versions: 0.96.0
>            Reporter: Matteo Bertozzi
>            Assignee: Matteo Bertozzi
>            Priority: Minor
>         Attachments: HBASE-7880-v0.patch
>
>
> Sometimes is useful to have a tool to migrate files from a new version to an 
> old version (e.g. convert a new XYZ encoded/compressed file to an old 
> "uncompressed" format)
> also it will be useful to been able to recover an hfile from a corrupted 
> state. (e.g. trailer missing/broken, ...) 
> The "user" can provide the information about the file (compression & co) and  
> try to recover as much as possible from the file by reading data blocks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to