On 10/13/14 1:04 PM, venkatesha murthy wrote: > Adding a bit more on this: > a) The DescriptiveStatisticalSummary actually handles the rest of the > functions such as addValue, getPercentile etc. > b) I have added addValue() as it is important to see either storeless or > store variants as interfaces. > c) A case in point being (for b); i was actually trying out a lockfull and > a lockfree based variants for descriptive statistical summary and it was > very concise/consistent with an interface to use that has all common > functions across all variants. > d) well lock based or lock free variants are not a part of this patch as > iam still working through > > However i feel the getPercentile can definitely add value. Please let me > know if i could turn in all the relevant methods of > DescriptiveStorelessStatistics into statistical summary (such as kurtosis, > skewness etc..) and then we could just use SummaryStatistics.
I am not sure I understand what you are proposing. Currently, we have two statistical "aggregates" for descriptive univariate stats: SummaryStatistics - aggregates "storeless" statistics over a stream of data that is not stored in memory DescriptiveStatistics - provides an extended set of statistics, some of which require that the full set of data be stored in memory The subject of this thread was a proposal to add quartiles to SummaryStatistics, as the new(ish) PSquarePercentile allows those statistics to be computed without storing the data. Phil > > On Tue, Oct 14, 2014 at 1:15 AM, venkatesha murthy < > venkateshamurth...@gmail.com> wrote: > >> Hi Phil, >> >> Though i did not add to StatisticalSummary i was actually working on a >> DescriptiveStatisticalSummary for all the Storeless variants inclusive of >> PSquarePercentile. Would it help if you can actually implement >> SummaryStatisitcs with an extended interface such as >> DescriptiveStatisticalSummary ? below. >> >> That said i actually wanted to discuss the new storelessvariant of >> descriptive statistics. >> a) DescriptiveStatisticalSummary - an extended interface for >> StatisticalSummary (adds a Generic type that can cater for store full and >> storeless) >> b) DescriptiveStorelessStatistics - Storeless variant of >> DescriptiveStatisitcs >> c) SynchronizedDescriptiveStorelessStatistics - a synchronized wrapper. >> >> Test case classes added to the same. >> >> Please let me know on this i could also accomodate the changes to summary >> stats based on this change here. >> Also please let me know if this could be raised as a jira ticket to pursue. >> >> Thanks >> Murthy >> >> On Sat, Oct 11, 2014 at 1:10 AM, Phil Steitz <phil.ste...@gmail.com> >> wrote: >> >>> Now that we have a "storeless" percentile estimator, we can add >>> quartile computation to SummaryStatistics. Any objections to my >>> adding this? I could optionally add a boolean constructor argument >>> to avoid the overhead of maintaining these stats. Or more >>> generally, add a bitfield encoding the exact set of stats the user >>> wants to maintain. If there are no objections to the addition, I >>> will open a JIRA. >>> >>> Phil >>> >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org >>> For additional commands, e-mail: dev-h...@commons.apache.org >>> >>> --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org For additional commands, e-mail: dev-h...@commons.apache.org