Thanks for your interest. I think the main challenge is if we have to call Python functions per record, it can be pretty expensive to serialize/deserialize across boundaries of the Python process and JVM process. I don't know if there is a good way to solve this problem yet.
On Fri, Aug 1, 2014 at 11:06 AM, Rajiv Abraham <rajiv.abra...@gmail.com> wrote: > Hi, > I just saw Ankur's GraphX presentation and it looks very exciting! I would > like to contribute to a Python version of GraphX. I checked out JIRA and > Github but I did not find much info. > > - Are there limitations currently to port GraphX in Python? (e.g. Maybe the > Python Spark RDD API is incomplete or not refactored for GraphX as compared > to the Scala version) > - If I had to start, could I take inspiration from the Scala version and > try to emulate it in Python? > - Otherwise any suggestions of starter tasks regarding GraphX in Python > would be appreciated > > > > -- > Take care, > Rajiv >