Hi,

I need to read multiple AvroParquet files (each written with a different
avro schema) in a single MR job.

The AvroParquetInputFormat has only a static method setAvroReadSchema() for
setting the reader schema.

I tried creating a Union Avro schema (union of the two individual avro
schemas) and setting that as the ReadSchema but turns out that in
AvroParquetInputFormat the top level item in the Avro schema has to be a
Record and not a union.

How can I achieve my usecase? Any suggestions/pointers most appreciated.

Thanks & Regards

MK

Reply via email to