On Fri, Aug 14, 2020 at 11:56 PM Micah Kornfield <emkornfi...@gmail.com> wrote:
>
> >
> > Regarding Plasma, you're right we should have started this conversation
> > earlier! The way it's being developed in Ray currently isn't useful as a
> > standalone project. We realized that tighter integration with Ray's object
> > lifetime tracking could be important, and removing IPCs and making it a
> > separate thread in the same process as our scheduler could make a big
> > difference for performance. Some of these optimizations wouldn't be easy
> > without a tight integration, so there are some trade-offs here.
>
> So I guess the question is whether it is worth continuing to try to
> maintain a sepearate version of plasma within the Arrow repo?
>

What isn't clear is whether the Plasma that's in Ray is usable in a
C++ library context (e.g. what we currently ship as libplasma-dev e.g.
on Ubuntu/Debian). That seems still useful, but if the project isn't
being actively maintained / developed (which, given the series of
stale PRs over the last year or two, it doesn't seem to be) it's
unclear whether we want to keep shipping it.

> On Tue, Jul 21, 2020 at 9:28 AM Robert Nishihara <robertnishih...@gmail.com>
> wrote:
>
> > Hi all,
> >
> > Regarding Plasma, you're right we should have started this conversation
> > earlier! The way it's being developed in Ray currently isn't useful as a
> > standalone project. We realized that tighter integration with Ray's object
> > lifetime tracking could be important, and removing IPCs and making it a
> > separate thread in the same process as our scheduler could make a big
> > difference for performance. Some of these optimizations wouldn't be easy
> > without a tight integration, so there are some trade-offs here.
> >
> > Regarding the Python serialization format, I agree with Antoine that it
> > should be deprecated. We began developing it before pickle 5, but now that
> > pickle 5 has taken off, it makes less sense (it's useful in its own right,
> > but at the end of the day, we were interested in it as a way to serialize
> > arbitrary Python objects).
> >
> > -Robert
> >
> > On Sun, Jul 12, 2020 at 5:26 PM Wes McKinney <wesmck...@gmail.com> wrote:
> >
> > > I'll add deprecation warnings to the pyarrow.serialize functions in
> > > question, it will be pretty simple.
> > >
> > > On Sun, Jul 12, 2020, 6:34 PM Neal Richardson <
> > neal.p.richard...@gmail.com
> > > >
> > > wrote:
> > >
> > > > This seems like something to investigate after the 1.0 release.
> > > >
> > > > Neal
> > > >
> > > > On Sun, Jul 12, 2020 at 11:53 AM Antoine Pitrou <anto...@python.org>
> > > > wrote:
> > > >
> > > > >
> > > > > I'd certainly like to deprecate our custom Python serialization
> > format,
> > > > > and using pickle protocol 5 instead is a very good idea.
> > > > >
> > > > > We can probably keep it in 1.0 while raising a FutureWarning.
> > > > >
> > > > > Regards
> > > > >
> > > > > Antoine.
> > > > >
> > > > >
> > > > > Le 12/07/2020 à 19:22, Wes McKinney a écrit :
> > > > > > It appears that the Ray developers have decided to fork Plasma and
> > > > > > decouple from the Arrow codebase:
> > > > > >
> > > > > > https://github.com/ray-project/ray/pull/9154
> > > > > >
> > > > > > This is a disappointing development to occur without any discussion
> > > on
> > > > > > this mailing list but given the lack of development activity on
> > > Plasma
> > > > > > I would like to see how others in the community would like to
> > > proceed.
> > > > > >
> > > > > > It appears additionally that the Union-based serialization format
> > > > > > implemented by arrow/python/serialize.h and the
> > pyarrow/serialize.py
> > > > > > has been dropped in favor of pickle5. If there is not value in
> > > > > > maintaining this code then it would probably be preferable for us
> > to
> > > > > > remove this from the codebase.
> > > > > >
> > > > > > Thanks,
> > > > > > Wes
> > > > > >
> > > > >
> > > >
> > >
> >

Reply via email to