Re: [postgis-users] Slow construction of GiST index, but better with smaller # of big rows

Bo Guo Sun, 13 Jan 2019 09:02:03 -0800

Wenbo,

The law of physics is in play here. I think your approach is creativeand valid. I wish Postgis offered grid-base spatial index which groupedgeometry BBOXs in grids with user defined levels /sizes. A couple ofother thoughts:

1. You may also look in to table partitioning to physically breakup thelarge table.

2. In addition, depend on how your data is used and whether or not thedata is static, vector-tiling/cacheing the geometry on disk (out-side ofdatabase) may help.


Bo


On 1/12/19 9:28 AM, Wenbo Tao wrote:

Hello,
I was trying to build a GiST index on a geometry column in a tablewith 1 billion rows. It took an entire week to finish.
Then I reduced the number of rows by grouping closer objects intoone clump (using some clustering algorithm), and then compressed theclump as one row (the geometry column becomes the bounding box of allobjects in that clump). The construction then went way faster -- downto 12 hours. I did this because the query I need to answer is findingall objects whose bbox intersects with a given rectangle. I can nowquery all clumps whose bbox intersects with the rectangle.
So essentially, the index construction is slow for too many rows,but much faster for a smaller # of bigger rows. Any intuition why thisis the case would be greatly appreciated!
Thank you,
Wenbo Tao

_______________________________________________
postgis-users mailing list
postgis-users@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/postgis-users

_______________________________________________
postgis-users mailing list
postgis-users@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/postgis-users

Re: [postgis-users] Slow construction of GiST index, but better with smaller # of big rows

Reply via email to