Github user josephlijia commented on the pull request:
https://github.com/apache/spark/pull/1297#issuecomment-159197234
We have implemented a faster way by using zipPartition. But the final
results are packaged in RDD. When data volumes are huge, it is much faster than
it is now. Could you please tell me how can I apply for contributing this into
IndexeddRDD? Thank you very much. I am expecting your answer.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]