If you want to do that outside of MR you can use
https://github.com/apache/incubator-parquet-mr/blob/master/parquet-avro/src/main/java/parquet/avro/AvroParquetReader.java




On Wed, Aug 27, 2014 at 11:09 AM, pratik khadloya <[email protected]>
wrote:

> Hello,
>
> Is there a reader which takes in a list of columns or a avro/thrift
> sub-schema and spits back a 2D array of rows?
>
> If not can anyone please give me an idea on how to go about coding that up.
> From the "DumpCommand" example in  the parquet-tools code, i see that to
> read multiple columns, it goes into a loop one by one and spits out one
> column at a time. Is that the best way to do it?
>
> Also refer:
>
> http://stackoverflow.com/questions/25334466/parquet-read-particular-columns-into-memory
>
>
> Thanks,
> Pratik
>

Reply via email to