Hi, I am performing Kolmogorov-Smirnov (K-S) testing and have a query about how best to deal with duplicates. The dataset I am working with has a lot of duplicates - after there removal is size is almost half. Which leads to my question,
When deleting duplicates does it matter which ones i delete ? I have done the K-S test deleting all after the first duplicate and all before the last duplicate bothing approaches giving different D-values (as I expected). However in the three cases I have tested this has had no significant impact upon the resultant p-values. Any advice would be greatly appreciated Thanks Michael . . ================================================================= Instructions for joining and leaving this list, remarks about the problem of INAPPROPRIATE MESSAGES, and archives are available at: . http://jse.stat.ncsu.edu/ . =================================================================
