Hi everyone, Last Parquet sync-up, I mentioned that I've been working on a new Parquet CLI tool (based on Cloudera's Kite CLI). I haven't had a chance to move the build to maven or get the licensing taken care of for an Apache submission, but it is clean enough that people can start looking at it. I've posted it here:
https://github.com/rdblue/parquet-cli The build uses gradle and the jar is run with the hadoop command, like the current tools. It is based on parquet-avro and can convert between Avro, Parquet, CSV, and JSON. It has been a great tool for trying different settings and having an easier time inspecting Parquet file metadata/dictionaries. Please have a look, I'm interested to know if anyone would like this added to the Parquet project. Thanks! rb -- Ryan Blue Software Engineer Netflix
