Joseph K. Bradley created SPARK-7461:
----------------------------------------

             Summary: Remove spark.ml Model, and have all Transformers have 
parent
                 Key: SPARK-7461
                 URL: https://issues.apache.org/jira/browse/SPARK-7461
             Project: Spark
          Issue Type: Improvement
          Components: ML
            Reporter: Joseph K. Bradley


A recent PR [https://github.com/apache/spark/pull/5980] brought up an issue 
with the Model abstraction: There are transformers which could be Transformers 
(created by a user) or Models (created by an Estimator).  This is the first 
instance, but there will be more such transformers in the future.

Some possible fixes are:
* Create 2 separate classes, 1 extending Transformer and 1 extending Model.  
These would be essentially the same, and they could share code (or have 1 wrap 
the other).  This would bloat the API.
* Just use Model, with a possibly null parent class.  There is precedence 
(meta-algorithms like RandomForest producing weak hypothesis Models with no 
parent).
* Change Transformer to have a parent which may be null.
** *--> Unless there is strong disagreement, I think we should go with this 
last option.*



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to