[ 
https://issues.apache.org/jira/browse/HBASE-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-2055:
----------------------------------

    Attachment: HBASE-2055-v2.patch

v2 patch passes all tests. Also, in this version we write the schema as a file 
header and use it to initialize the reader. 

In case anyone is curious, we are not using Avro's bundled file I/O package 
because the file format puts schema and metadata into a trailer so seems not 
suitable as a log which may be truncated as part of "normal" operation. 

> Serialize WAL as Avro records
> -----------------------------
>
>                 Key: HBASE-2055
>                 URL: https://issues.apache.org/jira/browse/HBASE-2055
>             Project: Hadoop HBase
>          Issue Type: Improvement
>            Reporter: Andrew Purtell
>            Priority: Minor
>         Attachments: HBASE-2055-v2.patch, HBASE-2055.patch, 
> jackson-core-asl-1.0.1.jar, jackson-mapper-asl-1.0.1.jar, paranamer-1.5.jar, 
> TEST-org.apache.hadoop.hbase.regionserver.wal.TestHLog.txt.gz, 
> TEST-org.apache.hadoop.hbase.regionserver.wal.TestLogRolling.txt.gz, 
> TEST-org.apache.hadoop.hbase.TestFullLogReconstruction.txt.gz, test-site.patch
>
>
> There was some advocacy of using Avro for serialization of HBase WAL records 
> up on hbase-...@. Idea is Hadoop core is getting away from Writables and Avro 
> is the blessed replacement. 
> I think we have this criteria for its use:
> 1) Performance of writing Avro records is no worse than that for writing 
> Writables into a SequenceFile.
> 2) Space consumed by Avro serialization is no worse than that of Writables
> 3) File format is amenable to appends (cannot require valid trailers, etc.)
> I'll put up a patch so we can try it out. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to