Marco Gaido created SPARK-23568:
-----------------------------------

             Summary: Silhouette should get number of features from metadata if 
available
                 Key: SPARK-23568
                 URL: https://issues.apache.org/jira/browse/SPARK-23568
             Project: Spark
          Issue Type: Improvement
          Components: ML
    Affects Versions: 2.4.0
            Reporter: Marco Gaido


In Silhouette computation we need to know the number of features. This is done 
taking the first row and checking the size of the features vector. Despite it 
works fine, if the number of attributes is present in the metadata of the 
column, we can avoid the additional job which is generated by using `first`.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to