[jira] [Commented] (STATISTICS-71) Implementation of Univariate Statistics

Gilles Sadowski (Jira) Sun, 02 Jul 2023 02:19:04 -0700


    [ 
https://issues.apache.org/jira/browse/STATISTICS-71?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17739367#comment-17739367
 ]


Gilles Sadowski commented on STATISTICS-71:
-------------------------------------------

bq. [...] for a simple statistic (min, max, sum) then computing the count is a 
waste of resources.

I imagine than an increment by one is a marginal cost wrt the mechanics of 
streams.
Even it is not necessary for computing the value, the count may be an 
information needed by many users.

bq. [...] you only require one [count].

Indeed, recomputing the same values several times would indicate a design 
problem.
What about {{Count}} being a {{DoubleStorelessStatistics}}, on which other(s) 
could depend?
When using a "shared" {{Count}} instance (e.g. in a {{SummaryStatistics}}), 
we'll have to ensure that all individual statistics can only be updated 
together.

> Implementation of Univariate Statistics
> ---------------------------------------
>
>                 Key: STATISTICS-71
>                 URL: https://issues.apache.org/jira/browse/STATISTICS-71
>             Project: Commons Statistics
>          Issue Type: Task
>          Components: descriptive
>            Reporter: Anirudh Joshi
>            Priority: Minor
>              Labels: gsoc, gsoc2023
>
> Jira ticket to track the implementation of the Univariate statistics required 
> for the updated SummaryStatistics API. 
> The implementation would be "storeless". It should be used for calculating 
> statistics that can be computed in one pass through the data without storing 
> the sample values.
> Currently I have the definition of API as (this might evolve as I continue 
> working)
> {code:java}
> public interface DoubleStorelessUnivariateStatistic extends DoubleSupplier {
>     DoubleStorelessUnivariateStatistic add(double v);
>     long getCount();
>     void combine(StorelessUnivariateStatistic other);
> } {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (STATISTICS-71) Implementation of Univariate Statistics

Reply via email to