Bago Amirbekian created SPARK-23562:
---------------------------------------

             Summary: RFormula handleInvalid should handle invalid values in 
non-string columns.
                 Key: SPARK-23562
                 URL: https://issues.apache.org/jira/browse/SPARK-23562
             Project: Spark
          Issue Type: Improvement
          Components: ML
    Affects Versions: 2.3.0
            Reporter: Bago Amirbekian


Currently when handleInvalid is set to 'keep' or 'skip' this only applies to 
String fields. Numeric fields that are null will either cause the transformer 
to fail or might be null in the resulting label column.

I'm not sure what the semantics of keep might be for numeric columns with null 
values, but we should be able to at least support skip for these types.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to