Re: [Jprogramming] faster empirical cumulative distribution function

[email protected] Thu, 14 Aug 2014 01:12:04 -0700

Fair enough.  
I was mainly interested whether you found an improvement 
for your needs with the verbs I suggested, especially ecdfsorted.



Since you wish to merge the probabilities with your data rows, if your 
input data is integer as in the example (unlikely?),  it might be worth 
keeping the frequencies as integer by not normalising.  It's easy 
enough to interpret "all rows (with probability) > 0.95" as "all rows 
with count > 0.95 * N"
eg
    ((#((+/\)))@:(#/.~)) 2 2 2 4 4 6 6 8

3 3 
3 5 5 7 7 8

NB apologies for silly line-wraps in my previous post and 
any in this one! 

On 13/08/2014 23:02, Joe Bogner wrote:
> On Wed, Aug 
13, 2014 at 5:56 PM, [email protected]
> <mike_liz.day@tiscali.
co.uk> wrote:
>
>> Isn't it more useful to produce a keyed table of 
cumulative
>> frequencies?
>> eg:
>>    (~.,.(((%{:)@:(+/\)))@:(#/.~))2 
2 2 4 4 6 6 8
>
> Thank you. The keyed table is useful, but ultimately 
I need to merge
> into the rows of the source table. I'll be running 
queries like -- get
> all rows > 0.95 or < 0.05 in which case I will 
want the source table
> with the frequencies as a new column
> 
----------------------------------------------------------------------

> For information about J forums see http://www.jsoftware.com/forums.htm
>

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jprogramming] faster empirical cumulative distribution function

Reply via email to