Yes, thats the flag we tried and ensured its getting read and propagated.

Regards
Manik Singla
+91-9996008893
+91-9665639677

"Life doesn't consist in holding good cards but playing those you hold
well."


On Mon, Oct 21, 2019 at 12:51 PM Driesprong, Fokko <fo...@driesprong.frl>
wrote:

> Thanks Manik,
>
> Did you try setting the Hadoop io.compression.codec.zstd.level config?
>
> Cheers, Fokko
>
> Op za 19 okt. 2019 om 12:24 schreef Manik Singla <smanik...@gmail.com>:
>
> > Hi Fokko and Martin
> >
> > We are using parquet-hadoop which support compressions from
> parquet-format.
> > In our case, we were getting same compression even after changing
> > compression level of zstd.
> > We confirmed that  set level is being passed by ZStandardCompressor in
> init
> > which is native call .
> >
> > To confirm the issue, we tried same by injecting own implementation of
> zstd
> > and that seem to work fine.
> > We will have a look how its working for spark and not for us.
> >
> > Regards
> > Manik Singla
> > +91-9996008893
> > +91-9665639677
> >
> > "Life doesn't consist in holding good cards but playing those you hold
> > well."
> >
> >
> > On Fri, Oct 18, 2019 at 5:20 PM Driesprong, Fokko <fo...@driesprong.frl>
> > wrote:
> >
> > > Hi Falak,
> > >
> > > I was able to set the compression level in Spark using
> > > spark.io.compression.zstd.level.
> > >
> > > Cheers, Fokko
> > >
> > > Op do 17 okt. 2019 om 20:53 schreef Radev, Martin <martin.ra...@tum.de
> >:
> > >
> > > > Hi Falak,
> > > >
> > > >
> > > > I was one of the people who recently exposed this to Arrow but this
> is
> > > not
> > > > part of the Parquet specification.
> > > >
> > > > In particular, any implementation for writing parquet files can
> decide
> > > > whether to expose this or select a reasonable value internally.
> > > >
> > > >
> > > > If you're using Arrow, you would have to read the documentation of
> the
> > > > specified compressor. Arrow doesn't do checks for whether specified
> > > > compression level is within the range of what's supported by the
> codec.
> > > For
> > > > ZSTD, the range should be [1, 22].
> > > >
> > > > Let me know if you're using Arrow and I can check locally that there
> > > isn't
> > > > by any chance a bug with propagating the value. At the moment there
> are
> > > > only smoke tests that nothing crashes.
> > > >
> > > >
> > > > Regards,
> > > >
> > > > Martin
> > > > ------------------------------
> > > > *From:* Falak Kansal <fa...@sumologic.com>
> > > > *Sent:* Thursday, October 17, 2019 4:43:54 PM
> > > > *To:* Driesprong, Fokko
> > > > *Cc:* dev@parquet.apache.org
> > > > *Subject:* Re: custom CompressionCodec support
> > > >
> > > > Hi Fokko,
> > > >
> > > > Thanks for replying, yes sure.
> > > > The problem we are facing is that with parquet zstd we are not able
> to
> > > > control the compression level, we tried setting different compression
> > > > levels but it doesn't make any difference in the size. We tested/have
> > > made
> > > > sure that we are getting the same compression level in
> > > > *ZStandardCompressor
> > > > *as we are setting in the configuration file. Are we missing
> something?
> > > How
> > > > can we set a different compression level of zstd? Help would be
> > > > appreciated.
> > > >
> > > > Thanks
> > > > Falak
> > > >
> > > > On Thu, Oct 17, 2019 at 7:47 PM Driesprong, Fokko
> <fo...@driesprong.frl
> > >
> > > > wrote:
> > > >
> > > > > Hi Manik,
> > > > >
> > > > > The supported compression codecs that ship with Parquet are tested
> > and
> > > > > validated in the CI pipeline. Sometimes there are issues with
> > > > compressors,
> > > > > therefore they are not easily pluggable. Feel free to open up a PR
> to
> > > the
> > > > > project if you believe if there are compressors missing, then we
> can
> > > > have a
> > > > > discussion.
> > > > >
> > > > > It is part of the Thrift definition:
> > > > >
> > > >
> > >
> >
> https://github.com/apache/parquet-format/blob/37bdba0a18cff18da706a0d353c65e726c8edca6/src/main/thrift/parquet.thrift#L470-L478
> > > > >
> > > > > Hope this clarifies the design decision.
> > > > >
> > > > > Cheers, Fokko
> > > > >
> > > > > Op di 15 okt. 2019 om 11:52 schreef Manik Singla <
> > smanik...@gmail.com
> > > >:
> > > > >
> > > > >> Hi
> > > > >>
> > > > >> Current java code is not open to use custom compressor.
> > > > >> I believe mostly read/write is done by same team/company.  In that
> > > case,
> > > > >> it
> > > > >> would be beneficial to add this support that user can plug new
> > > > compressor
> > > > >> easily instead of doing local changes which will be prone to uses
> > > across
> > > > >> version upgrades.
> > > > >>
> > > > >> Do you guys think it will be worth to add
> > > > >>
> > > > >> Regards
> > > > >> Manik Singla
> > > > >> +91-9996008893
> > > > >> +91-9665639677
> > > > >>
> > > > >> "Life doesn't consist in holding good cards but playing those you
> > hold
> > > > >> well."
> > > > >>
> > > > >
> > > >
> > >
> >
>

Reply via email to