Github user cloud-fan commented on the pull request:

    https://github.com/apache/spark/pull/12061#issuecomment-206350054
  
    The benchmark result for master is:
    ```
    back-to-back filter:                Best/Avg Time(ms)    Rate(M/s)   Per 
Row(ns)   Relative
    
-------------------------------------------------------------------------------------------
    Dataset                                   706 /  791         14.2          
70.6       1.0X
    DataFrame                                  58 /   68        173.2           
5.8      12.2X
    RDD                                       204 /  217         48.9          
20.4       3.5X
    ```
    The original version is actually a little faster, (much faster than the 
back-to-back map case, which takes about 2500ms as Average time), still 
investigating why.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to