Sounds great, thanks.
A number of people are looking at using Arrow for faster serialization
for PySpark, so now that the Java and C++ libraries are compatible
(cf: integration tests) we can make this a reality.
On Thu, Dec 15, 2016 at 4:54 PM, Julien Le Dem wrote:
> I'm
I'm happy to reach out to Matei.
Reynold is on this list and the Arrow PMC as well.
Wes, I can start with an email and CC you.
On Thu, Dec 15, 2016 at 11:03 AM, Mark Hamstra
wrote:
> I already made sure that Matei is aware of this thread. He seemed
> interested in
I already made sure that Matei is aware of this thread. He seemed
interested in talking with key Arrow developers.
On Thu, Dec 15, 2016 at 10:49 AM, Julian Hyde wrote:
> I think someone should reach out to Matei and Shoumik, and see if they
> would like to collaborate. Wes,
The PySpark community is aware of arrow, but certainly more reaching out to
the Spark SQL devs could really be beneficial to get us all on the same
page :)
On Thu, Dec 15, 2016 at 10:49 AM Julian Hyde wrote:
> I think someone should reach out to Matei and Shoumik, and see if
I think someone should reach out to Matei and Shoumik, and see if they would
like to collaborate. Wes, would you like to do that?
Also, reach out to the Spark community. Are they aware of Arrow? Are they
planning to use it, or are they developing an alternative?
Julian
> On Dec 13, 2016, at
Uwe L. Korn created ARROW-426:
-
Summary: Python: Conversion from pyarrow.Array to a Python list
Key: ARROW-426
URL: https://issues.apache.org/jira/browse/ARROW-426
Project: Apache Arrow
Issue