Hi,

I am performing Kolmogorov-Smirnov (K-S) testing and have a query
about how best to deal with duplicates. The dataset I am working with
has a lot of duplicates - after there removal is size is almost half.
Which leads to my question,

When deleting duplicates does it matter which ones i delete ?

I have done the K-S test deleting all after the first duplicate and
all before the last duplicate bothing approaches giving different
D-values (as I expected). However in the three cases I have tested
this has had no significant impact upon the resultant p-values.

Any advice would be greatly appreciated

Thanks
Michael
.
.
=================================================================
Instructions for joining and leaving this list, remarks about the
problem of INAPPROPRIATE MESSAGES, and archives are available at:
.                  http://jse.stat.ncsu.edu/                    .
=================================================================

Reply via email to