How would I go about reading a parquet file locally in java code? Say I have a 
java.io.InputStream - what's the simplest way to get the structured data out of 
it, one record at a time?

Same question for writing data to a file locally.

I'm looking for something that preferably doesn't involve hadoop - just a 
simple reader/writer for the parquet format.

Feel free to point me at any existing examples or discussions, but everything 
I've seen has been tightly coupled to the hadoop/pig/etc.

This is the closest I could find for reading, but still uses hadoop stuff: 
https://github.com/apache/incubator-parquet-mr/blob/a0c77b6a442e2c4a355a4b145898bed976f23bb4/parquet-tools/src/main/java/parquet/tools/command/CatCommand.java#L56

Thanks,
Matt



Reply via email to