On Wed, Dec 11, 2013 at 12:08 PM, Greg Stark <[email protected]> wrote: > The only thing I can think > of is maybe the most common elements are being selected preferentially > from the early part of the sample which is removing a substantial part > of the lower end of the range. But even removing 100 from the > beginning shouldn't be enough to push the median above 550.
Just to follow up here. I think what's going is that not only are the most_common_vals being preferentially taken from the beginning of the sample but also their frequency is being massively overestimated. All values have a frequency of about .001 but the head of the MCV has a frequency as high as .10 in some of my tests. -- greg -- Sent via pgsql-hackers mailing list ([email protected]) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
