If you want to do that outside of MR you can use https://github.com/apache/incubator-parquet-mr/blob/master/parquet-avro/src/main/java/parquet/avro/AvroParquetReader.java
On Wed, Aug 27, 2014 at 11:09 AM, pratik khadloya <[email protected]> wrote: > Hello, > > Is there a reader which takes in a list of columns or a avro/thrift > sub-schema and spits back a 2D array of rows? > > If not can anyone please give me an idea on how to go about coding that up. > From the "DumpCommand" example in the parquet-tools code, i see that to > read multiple columns, it goes into a loop one by one and spits out one > column at a time. Is that the best way to do it? > > Also refer: > > http://stackoverflow.com/questions/25334466/parquet-read-particular-columns-into-memory > > > Thanks, > Pratik >
