Avro Container Files are always splittable[1]. They're the way you will commonly interact with Avro serialized data.
Data serialized as Avro's binary encoding is not splittable by itself, because the encoding includes no markers[2]. This may be the source of the disconnect you're finding in online docs. [1]: http://avro.apache.org/docs/1.7.7/spec.html#Object+Container+Files [2]: http://avro.apache.org/docs/1.7.7/spec.html#Data+Serialization On Thu, Jun 25, 2015 at 12:54 AM, Ankur Jain <[email protected]> wrote: > Hello, > > > > I am reading various forms and docs, somewhere it is mentioned that avro > is splittable and somewhere non-splittable. > > So which one is right?? > > > > Regards, > > Ankur > > > Information transmitted by this e-mail is proprietary to YASH > Technologies and/ or its Customers and is intended for use only by the > individual or entity to which it is addressed, and may contain information > that is privileged, confidential or exempt from disclosure under applicable > law. If you are not the intended recipient or it appears that this mail has > been forwarded to you without proper authority, you are notified that any > use or dissemination of this information in any manner is strictly > prohibited. In such cases, please notify us immediately at [email protected] > and delete this mail from your records. > -- Sean
