Thanks for your interest.

I think the main challenge is if we have to call Python functions per
record, it can be pretty expensive to serialize/deserialize across
boundaries of the Python process and JVM process.  I don't know if there is
a good way to solve this problem yet.





On Fri, Aug 1, 2014 at 11:06 AM, Rajiv Abraham <rajiv.abra...@gmail.com>
wrote:

> Hi,
> I just saw Ankur's GraphX presentation and it looks very exciting! I would
> like to contribute to a Python version of GraphX. I checked out JIRA and
> Github but I did not find much info.
>
> - Are there limitations currently to port GraphX in Python? (e.g. Maybe the
> Python Spark RDD API is incomplete or not refactored for GraphX as compared
> to the Scala version)
> - If I had to start, could  I take inspiration from the Scala version and
> try to emulate it in Python?
> - Otherwise any suggestions of  starter tasks regarding GraphX in Python
> would be appreciated
>
>
>
> --
> Take care,
> Rajiv
>

Reply via email to