Thanks for quick answer.
It is totally OK and reasonable to take one file in a directory and
assume all other has the same schema.
On 2014-01-27 18:27, Josh Wills wrote:
No, I haven't written a way to do that yet, and I feel bad about it--
a Clouderan asked me for just such a feature a couple of weeks ago and
it slipped my mind. I don't think it's hard to do, just a little
tedious and will require refreshing my memory of the Avro APIs.
There's also the potential issue that multiple Avro files in the same
input directory can have different schemas, so the one we would end up
reading might be somewhat arbitrary (e.g., based on the timestamp of
the files in the directory, or some such thing)-- is that ok?
On Mon, Jan 27, 2014 at 9:12 AM, Magnus Runesson <[email protected]
<mailto:[email protected]>> wrote:
Can I in (s)crunch read an Avro-file to GenericRecord without
provide the schema? I want crunch to get the schema from the
avro-file it reads. How do I do it?
/Magnus