[ 
https://issues.apache.org/jira/browse/SPARK-16074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356655#comment-15356655
 ] 

Max Moroz commented on SPARK-16074:
-----------------------------------

The component for this and another 3 issues is mis-spelt MLilb instead of 
MLLib. This causes more issues to end up in this category because of automatic 
completion/suggestion. Can someone who has edit rights on issues authored by 
others, correct these four issues to get rid of the mis-spelt component name?

> Expose VectorUDT/MatrixUDT in a public API
> ------------------------------------------
>
>                 Key: SPARK-16074
>                 URL: https://issues.apache.org/jira/browse/SPARK-16074
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLilb
>    Affects Versions: 2.0.0
>            Reporter: Xiangrui Meng
>            Assignee: Xiangrui Meng
>            Priority: Critical
>             Fix For: 2.0.0
>
>
> Both VectorUDT and MatrixUDT are private APIs, because UserDefinedType itself 
> is private in Spark. However, in order to let developers implement their own 
> transformers and estimators, we should expose both types in a public API to 
> simply the implementation of transformSchema, transform, etc. Otherwise, they 
> need to get the data types using reflection.
> Note that this doesn't mean to expose VectorUDT/MatrixUDT classes. We can 
> just have a method or a static value that returns VectorUDT/MatrixUDT 
> instance with DataType as the return type. There are two ways to implement 
> this:
> 1. following DataTypes.java in SQL, so Java users doesn't need the extra "()".
> 2. Define DataTypes in Scala.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to