Unless someone has implemented the corresponding changes in Parquet-MR it will not be compatible with hadoop (I haven't been paying close attention but I don't recall seeing a PR adding support for parquet-mr).
On Tue, Jun 29, 2021 at 5:01 PM Weston Pace <[email protected]> wrote: > Right now I believe July is the target. More info here: > > https://lists.apache.org/thread.html/r69825df037e16a0c10add07cdd6c8b3bb0010bbc18ba56bfdc1c5df8%40%3Cdev.arrow.apache.org%3E > > On Tue, Jun 29, 2021 at 1:51 PM Yuan, Hangjian <[email protected]> wrote: > > > > Hi Apache Arrow community, > > > > > > > > Thank you for the effort of making such a great open-source framework. :) > > > > > > > > I found Parquet C++ lib in Apache Arrow 5.0 will provide LZ4_RAW option > for compressing parquet files, which should be compatible with Hadoop. Can > I get a rough release date for version 5 or LZ4_RAW option? > > > > > > > > Thanks, > > > > Hangjian >
