Github user tanwanirahul commented on the pull request:

    https://github.com/apache/spark/pull/11219#issuecomment-184815144
  
    @JoshRosen Sure. If someone confirms on this, I will add that.
    
    On if we need it: The pipeline apis work on the dataframes. If we don't 
have this interface, we are required to convert dataframes back to RDD and make 
it work. 
    
    The other question I had was: Current implementation assumes the first 
column to be prediction and the second one the label/observation. Is this 
reasonable assumption to make? Shouldn't we rather expect the prediction and 
label/observation column names to be provided, the way other ml algorithms 
expect?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to