On Wed, Dec 11, 2013 at 12:08 PM, Greg Stark <st...@mit.edu> wrote: > The only thing I can think > of is maybe the most common elements are being selected preferentially > from the early part of the sample which is removing a substantial part > of the lower end of the range. But even removing 100 from the > beginning shouldn't be enough to push the median above 550.
Just to follow up here. I think what's going is that not only are the most_common_vals being preferentially taken from the beginning of the sample but also their frequency is being massively overestimated. All values have a frequency of about .001 but the head of the MCV has a frequency as high as .10 in some of my tests. -- greg -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers