@wesm the tricky part is that parquet-mr uses `hadoop_lz4` but marks the compression metadata as `lz4`. So arrow cannot detect the codec is actually `hadoop_lz4`.
we could the following instead: 1. change parquet-mr lz4 to use 'lz4/lz4-java' implementation, just like parquet-mr uses `org.xerial.snappy` instead of `hadoop_snappy`. or/and 2. try to add lz4-frame codec to both parquet-mr and arrow [ Full content available at: https://github.com/apache/arrow/pull/2479 ] This message was relayed via gitbox.apache.org for [email protected]
