On Thu, Jun 19, 2014 at 3:56 PM, Josh Rosen <rosenvi...@gmail.com> wrote:
> Thanks for helping with the Dill integration; I had some early first 
> attempts, but had to set them aside when I got busy with some other work.
>
> Just to bring everyone up to speed regarding context:
> There are some objects that PySpark’s `cloudpickle` library doesn’t serialize 
> properly, such as operator.getattr 
> (https://issues.apache.org/jira/browse/SPARK-791) or NamedTuples 
> (https://issues.apache.org/jira/browse/SPARK-1687).
> My early attempt at replacing CloudPickle with Dill ran into problems because 
> of slight differences in how Dill pickles functions defined in doctests 
> versus functions defined elsewhere.  I opened a bug report for this with the 
> Dill developers (https://github.com/uqfoundation/dill/issues/18), who 
> subsequently fixed the bug (https://github.com/uqfoundation/dill/pull/29).
> It looks like there’s already a couple of Dill issues with examples of the 
> “Can’t pickle _ it’s not found as _” bug 
> (https://github.com/uqfoundation/dill/search?q=%22not+found+as%22&type=Issues).
>   If you can find a small test case that reproduces this bug, I’d consider 
> opening a new Dill issue.

Thanks for the context, Josh.

I've gone ahead and created a new test case and just opened a new issue;

https://github.com/uqfoundation/dill/issues/49

Reply via email to