> Reference? What do you mean by "removing > duplicates" and what is this K-S test about?
I am testing goodness-of-fit for continuous distributions. I have been taught to delete any duplicate data points for example if we have 10,33,44,44,44,44,55,56 I would calculate my sample and theorectical probabilites then delete all but ONE of the 44's (the duplicate data points) and the calculate the differences and then find then take the absolute values of the differences and my D-value is the largest of those. The question is which ONE do leave behind? Its a choice that has an impact on the resulting D-value. The dataset I am working almost halves in size when I remove the duplicates, I have a lot of measurments that are the same. I am testing the goodness-of-fit of the data to see if it's distrobution is Gamma (or one of its special cases). I hope the description of what I'm doing makes sense. What method do you use to test goodness-of-fit for coninuous distributions? Thanks in advance Michael . . ================================================================= Instructions for joining and leaving this list, remarks about the problem of INAPPROPRIATE MESSAGES, and archives are available at: . http://jse.stat.ncsu.edu/ . =================================================================
