[
https://issues.apache.org/jira/browse/SPARK-14760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15250887#comment-15250887
]
Joseph K. Bradley commented on SPARK-14760:
-------------------------------------------
transformSchema is there for schema validation (see [SPARK-14608]) It'd be
worth discussing whether an individual transformer needs to invoke schema
validation before fitting or transforming. I'd say that, in general, it is a
judgement call depending on whether transformSchema is expensive/cheap, throws
a better error than fitting without a check, etc.
> Feature transformers should always invoke transformSchema in transform or fit
> -----------------------------------------------------------------------------
>
> Key: SPARK-14760
> URL: https://issues.apache.org/jira/browse/SPARK-14760
> Project: Spark
> Issue Type: Improvement
> Components: ML
> Reporter: yuhao yang
> Priority: Minor
>
> Since one of the primary function for transformSchema is to conduct parameter
> validation, transformers should always invoke transformSchema in transform
> and fit.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]