Xiangrui Meng created SPARK-7809:
------------------------------------

             Summary: MultivariateOnlineSummarizer should allow users to 
configure what to compute
                 Key: SPARK-7809
                 URL: https://issues.apache.org/jira/browse/SPARK-7809
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
    Affects Versions: 1.4.0
            Reporter: Xiangrui Meng


Now MultivariateOnlineSummarizer computes every summary statistics it can 
provide, which is okay and convenient for small number of features. It the 
feature dimension is large, this becomes expensive. So we should add setters to 
allow users to configure what to compute.

{code}
val summarizer = new MultivariateOnlineSummarizer()
  .withMean(false)
  .withMax(false)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to