Xiangrui Meng created SPARK-7809:
------------------------------------
Summary: MultivariateOnlineSummarizer should allow users to
configure what to compute
Key: SPARK-7809
URL: https://issues.apache.org/jira/browse/SPARK-7809
Project: Spark
Issue Type: Improvement
Components: MLlib
Affects Versions: 1.4.0
Reporter: Xiangrui Meng
Now MultivariateOnlineSummarizer computes every summary statistics it can
provide, which is okay and convenient for small number of features. It the
feature dimension is large, this becomes expensive. So we should add setters to
allow users to configure what to compute.
{code}
val summarizer = new MultivariateOnlineSummarizer()
.withMean(false)
.withMax(false)
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]