avro RAM usage

marius Wed, 12 Aug 2015 07:16:45 -0700

Hey,

i am currently doing some performance tests for my BSc thesis and iwondered how exactly the parsing of avro files when reading them works.From my understanding the data is read block by block from the file(rather than datum by datum) and then the datums are deserialized. Isthis correct (this would mean that the memory usage of avro is dependingon the block size rather than the datum size of each datum) or does itdepend on the used implementation?

My second question is if there is a way to read the file datum by datum.I want to create an index which stores the byte offsets of the avro fileso i can use e.g. seek() to go to that position and deserialize thefollowing datum. Is this even possible or can i only start at positionswith sync marker?


Greetings and thanks

Marius

avro RAM usage

Reply via email to