This is really cool news -- it's always really interesting to see benchmark studies and the trade-offs we make while choosing different formats. Thanks for sharing!
I'd love to see links to some curated articles and papers on the website! I created AVRO-3308 if you don't object :D Ryan On Fri, Jan 14, 2022 at 10:49 AM Martin Grigorov <mgrigo...@apache.org> wrote: > > Hi Juan, > > Thank you for sharing your work with us! > > It comes right in time for me! > I am working on the interop tests for the new Rust module and it seems there > is some problem to read the .avro files generated by Java. I may need to dive > in the binary diffs. > > Regards, > Martin > > On Thu, Jan 13, 2022 at 11:14 PM Juan Cruz Viotti <j...@jviotti.com> wrote: >> >> Hey there! >> >> As part of my MSc dissertation at University of Oxford, I wrote and >> published two papers covering the characteristics of various binary >> serialization formats, including Apache Avro and performing a >> space-efficiency benchmark, respectively. >> >> Sharing them here in case anybody finds them interesting! The first >> paper explains how Apache Avro works including an annotated hexadecimal >> example and the second compares Apache Avro to various other popular >> serialization formats. >> >> - A Survey of JSON-compatible Binary Serialization Specifications: >> https://arxiv.org/abs/2201.02089 >> - A Benchmark of JSON-compatible Binary Serialization Specifications: >> https://arxiv.org/abs/2201.03051 >> >> The benchmark study has proved Apache Avro to be one of the most >> space-efficient formats considered. >> >> All the best! >> >> -- >> Juan Cruz Viotti >> Technical Lead @ Postman.com >> https://www.jviotti.com