[HACKERS] Fast GiST index build - further improvements

Heikki Linnakangas Thu, 08 Sep 2011 09:36:23 -0700

Now that the main GiST index build patch has been committed, there's afew further improvements that could make it much faster still:

Better management of the buffer pages on disk. At the moment, thetemporary file is used as a heap of pages belonging to all the buffersin random order. I think we could speed up the algorithm considerably byreading/writing the buffer pages sequentially. For example, when aninternal page is split, and all the tuples in its buffer are relocated,that would be a great chance to write the new pages of the new buffersin sequential order, instead of writing them back to the pages freed upby the original buffer, which can be spread all around the temp file. Iwonder if we could use a separate file for each buffer? Or at least, aseparate file for all buffers that are larger than, say 100 MB in size.

Better management of in-memory buffer pages. When we start emptying abuffer, we currently flush all the buffer pages in memory to thetemporary file, to make room for new buffer pages. But that's a waste oftime, if some of the pages we had in memory belonged to the buffer we'reabout to empty next, or that we empty tuples to. Also, if while emptyinga buffer, all the tuples go to just one or two lower level buffers, itwould be beneficial to keep more than one page in-memory for those buffers.

Buffering leaf pages. This I posted on a separate thread already:http://archives.postgresql.org/message-id/4e5350db.3060...@enterprisedb.com

Also, at the moment there is one issue with the algorithm that we haveglossed over this far: For each buffer, we keep some information inmemory, in the hash table, and in the auxiliary lists. That means thatthe amount of memory needed for the build scales with the size of theindex. If you're dealing with very large indexes, hopefully you alsohave a lot of RAM in your system, so I don't think this is a problem inpractice. Still, it would be nice to do something about that. Astraightforward idea would be to swap some of the information to disk.Another idea that, simpler to implement, would be to completely destroya buffer, freeing all the memory it uses, when it becomes completelyempty. Then, if you're about to run out of memory (as defined bymaintenance_work_mem), you can empty some low level buffers to disk tofree up some.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Fast GiST index build - further improvements

Reply via email to