Github user FRosner commented on the pull request:

    https://github.com/apache/spark/pull/9222#issuecomment-150567323
  
    @felixcheung it is strange. When I did some performance checks today, the 
run time did not seem to be quadradic. Is it maybe because the map 
transformation in the original code is lazy and only executed when you actually 
access the data?
    
    I can tell you that the original implementation did not succeed on our file 
while mine did. I will investigate a bit more. Maybe someone else has an 
opinion?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to