On Tue, Apr 5, 2011 at 8:41 AM, Alexander Korotkov <aekorot...@gmail.com> wrote: > For example, here is distribution of q-grams count in 120 Mb of dblp paper > titles (pretty large dataset). > q count > 2 7218 > 3 115107 > 4 589428 > 5 1648453 > 6 3336685 > Number of 5-grams if about 15x larger than number of 3-grams. But most part > of index space will be occupied by links to the rows(about 120 millions of > links), while size of q-grams itself will be almost ignorable in comparison > with it.
I am probably being stupid here, but doesn't the number of links to rows grow proportionately to the number of n-grams? -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers