[ 
https://issues.apache.org/jira/browse/MATH-437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12975771#action_12975771
 ] 

Mikkel Meyer Andersen commented on MATH-437:
--------------------------------------------

In the past months, I've communicated with both Richard Simard and George 
Marsaglia regarding small disagreement between theory in Marsaglia's article 
and the actual implementation; namely the fact that 0 <= h < 1, but in the code 
0 < h <= 1. I wrote to Marsaglia regarding this, and his answer was: 
{quote}
The Kolmogorov distribution comes from a piecewise polynomial in h with knots 
at 1/2n, 2/2n,...,(2n-1)/2n,  with each segment assumed to start with h=0. 
Although I emphasized that 0<= h <1 in the article,  I overlooked the need for 
ensuring that in the C code, and apparently so did my colleagues. Sorry about 
that.
{quote}
This means that his code has to be changed slightly to ensure that 0 <= h < 1. 
Simard argues that this shouldn't mean anything because KS distribution is 
continuous, but if one wants to correct it, one should
{quote}
Instead of taking the floor(n*d + 1) and making this correction for h = 1, take 
the ceiling (n*d).
{quote}

I would prefer using ceiling (n*d) instead of the originally (wrongly) proposed 
floor(n*d + 1), despite arguments of continuity. So my plan is to do this (I 
still have my implementation which seem to work quite okay). The only problem 
is that R seems to use Marsaglia's code, and I don't have access to e.g. 
Mathematica which should implement several algorithms, so I might run into 
problems when I have to perform tests.

> Kolmogorov Smirnov Distribution
> -------------------------------
>
>                 Key: MATH-437
>                 URL: https://issues.apache.org/jira/browse/MATH-437
>             Project: Commons Math
>          Issue Type: New Feature
>            Reporter: Mikkel Meyer Andersen
>            Assignee: Mikkel Meyer Andersen
>            Priority: Minor
>             Fix For: 3.0
>
>   Original Estimate: 0.25h
>  Remaining Estimate: 0.25h
>
> Kolmogorov-Smirnov test (see [1]) is used to test if one sample against a 
> known probability density functions or if two samples are from the same 
> distribution. To evaluate the test statistic, the Kolmogorov-Smirnov 
> distribution is used. Quite good asymptotics exist for the one-sided test, 
> but it's more difficult for the two-sided test.
> [1]: http://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to