As I understand it, Avro container files contain synchronization markers
every so often to support splitting the file.  See:
https://cwiki.apache.org/AVRO/faq.html#FAQ-Whatisthepurposeofthesyncmarkerintheobjectfileformat%3F

(1) Why isn't the synchronization marker the same for every container
file?  (i.e. what is the point of generating it randomly every time)

(2) Is it possible, at least in theory, for naturally occurring data to
contain bytes that match the sync marker? If so, would this break
synchronization?

Thanks,
Josh

Reply via email to