This is about the binary de/encoding, I think the variant is a separate
discussion.

There is a Python implementation here
<https://github.com/apache/spark/blob/master/python/pyspark/sql/variant_utils.py>,
and there seems to be interest from the Arrow/Rust
<https://github.com/apache/arrow-rs/issues/6736> community as well.

Kind regards,
Fokko

Op ma 27 jan 2025 om 18:32 schreef Russell Spitzer <
russell.spit...@gmail.com>:

> The issue we brought up in the discussion was that it's independent of the
> Shredding spec which we are also trying to get in. The
> python implementation is only for the serialization of a non-shredded
> Variant correct?  I
> think we have our bases covered though with some other language
> implementations in the works though
> so I'm not sure it's a problem.
>
> On Mon, Jan 27, 2025 at 11:29 AM Daniel Weeks <dwe...@apache.org> wrote:
>
> > Yeah, sorry this thread got separated from the original email (for me).
> >
> > The python implementation currently in Spark is the one I was referring
> > to.  I believe there is openness to moving that to a separate repository
> if
> > necessary, but I would also think that if the goal is to prove that there
> > is no language specific requirement in the spec, the location of the
> > implementation should not matter (e.g. if a rust implementation in Apache
> > Arrow would satisfy the requirement, why wouldn't a python implementation
> > in Apache Spark)?
> >
> > -Dan
> >
> >
> > On Sun, Jan 26, 2025 at 4:49 PM Julien Le Dem <jul...@apache.org> wrote:
> >
> > > Dan,
> > > This came from you.
> > > Is it the one?
> > > Thank you
> > > Julien
> > >
> > > On Sat, Jan 25, 2025 at 11:20 AM Andrew Lamb <andrewlam...@gmail.com>
> > > wrote:
> > >
> > >> Perhaps is the one[1] referred to by Curt[2]
> > >>
> > >> [1]:
> > >>
> > >>
> >
> https://github.com/apache/spark/blob/master/python/pyspark/sql/variant_utils.py
> > >> [2]:
> > >>
> https://github.com/apache/arrow-rs/issues/6736#issuecomment-2614029393
> > >>
> > >>
> > >>
> > >> On Wed, Jan 22, 2025 at 12:12 PM Micah Kornfield <
> emkornfi...@gmail.com
> > >
> > >> wrote:
> > >>
> > >> > What python implementation are we talking about here?
> > >> >
> > >> > On Wednesday, January 22, 2025, Andrew Lamb <andrewlam...@gmail.com
> >
> > >> > wrote:
> > >> >
> > >> > > I also unfortunately will not be able to make it today.
> > >> > >
> > >> > > As a brief update, I am working on finding someone to help with
> the
> > >> Rust
> > >> > > implementation of variant. Moving forward with Java and Python
> seems
> > >> > > reasonable to me, though I would truly love to get a Rust
> > >> implementation
> > >> > to
> > >> > > ensure there is no potential gotcha's for a native implementation
> > >> > >
> > >> > > Thanks,
> > >> > > Andrew
> > >> > >
> > >> > > On Wed, Jan 22, 2025 at 11:41 AM Daniel Weeks <dwe...@apache.org>
> > >> wrote:
> > >> > >
> > >> > > > Hey Julien,
> > >> > > >
> > >> > > > I'm not going to be able to attend today's meeting, but just
> > wanted
> > >> to
> > >> > > > follow up on a few of the items from the last meeting.
> > >> > > >
> > >> > > > With respect to the reference implementations for Variant, we
> had
> > >> > > > discussed the possibility of Rust or C++, but those both have
> > >> > significant
> > >> > > > work.  The Java and native Python implementations are much
> closer
> > >> and
> > >> > > > should cover the concerns for verification of the spec.  I still
> > >> think
> > >> > > > there will be work on the Rust side, but I don't think there's a
> > C++
> > >> > > > implementation that would be in a state to open source.
> > >> > > >
> > >> > > > For the shredding spec, Micah, Ryan, Russel and I met and are
> > >> closing
> > >> > in
> > >> > > on
> > >> > > > wording that everyone is happy with, so I expect that will close
> > out
> > >> > > > shortly.
> > >> > > >
> > >> > > > -Dan
> > >> > > >
> > >> > > > On Wed, Jan 22, 2025 at 7:41 AM Julien Le Dem <
> jul...@apache.org>
> > >> > wrote:
> > >> > > >
> > >> > > > > The next Parquet sync is today Jan 22nd at 9:30am PT - 12:30pm
> > ET
> > >> -
> > >> > > > 6:30pm
> > >> > > > > CET
> > >> > > > > (in about 2hs)
> > >> > > > > To join the invite:
> > >> > > > > https://calendar.app.google/xXGgYU6evBArpzdZ9
> > >> > > > > Please contact me to be added to the recurring invite.
> > >> > > > > Everybody is welcome, bring your topic or just listen in.
> > >> > > > > Best
> > >> > > > > Julien
> > >> > > > >
> > >> > > >
> > >> > >
> > >> >
> > >>
> > >
> >
>

Reply via email to