[
https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097404#comment-15097404
]
Joseph K. Bradley commented on SPARK-12746:
-------------------------------------------
I may take a bit to think about this. I think the deeper question is whether
and how we should support nullability in ML, and there may need to be a little
design discussion around that. I'll try to get back soon.
Btw, please don't set the shepherd field. Committers use it to indicate that
they have the time and intent to merge a PR for a particular release.
> ArrayType(_, true) should also accept ArrayType(_, false)
> ---------------------------------------------------------
>
> Key: SPARK-12746
> URL: https://issues.apache.org/jira/browse/SPARK-12746
> Project: Spark
> Issue Type: Bug
> Components: ML, SQL
> Affects Versions: 1.6.0
> Reporter: Earthson Lu
>
> I see CountVectorizer has schema check for ArrayType which has
> ArrayType(StringType, true).
> ArrayType(String, false) is just a special case of ArrayType(String, true),
> but it will not pass this type check.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]