[
https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16062923#comment-16062923
]
Nick Pentreath commented on SPARK-21199:
----------------------------------------
Can you expand on how the null vectors land up in the dataset? It doesn't seem
a common scenario to me.
> Its not possible to impute Vector types
> ---------------------------------------
>
> Key: SPARK-21199
> URL: https://issues.apache.org/jira/browse/SPARK-21199
> Project: Spark
> Issue Type: Bug
> Components: Spark Core
> Affects Versions: 2.0.0, 2.1.1
> Reporter: Franklyn Dsouza
>
> There are cases where nulls end up in vector columns in dataframes. Currently
> there is no way to fill in these nulls because its not possible to create a
> literal vector column expression using lit().
> Also the entire pyspark ml api will fail when they encounter nulls so this
> makes it hard to work with the data.
> I think that either vector support should be added to the imputer or vectors
> should be supported in column expressions so they can be used in a coalesce.
> [~mlnick]
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]