Re: [HACKERS] Is it possible to have a "fast-write" Index?

Gavin Flower Fri, 05 Jun 2015 16:13:21 -0700

On 06/06/15 04:07, deavid wrote:

There are several use cases where I see useful an index, but adding itwill slow too much inserts and updates.For example, when we have 10 million rows on a table, and it's a tablewhich has frequent updates, we need several index to speed up selects,but then we'll slow down updates a lot, specially when we have 10 ormore indexes.Other cases involve indexes for text search, which are used only foruser search and aren't that important, so we want to have them, but wedon't want the overload they put whenever we write on the table.I know different approaches that already solve some of those problemsin some ways (table partitioning, partial indexes, etc), but i don'tfeel they are the solution to every problem of this kind.
Some people already asked for "delayed write" indexes, but the ideagets discarded because the index could get out of sync, so it can omitresults and this is unacceptable. But i think maybe that could befixed in several ways and we can have a fast and reliable index (butmaybe not so fast on selects).
Since I do not know every internal of postgres, i feel simpler toshare here and ask which things can or cannot be done.
Let's imagine there is a new type of index called "weird_btree", wherewe trade-off simplicity for speed. In almost every mode, we will relyon VACUUM to put our index in optimal state.
Mode 1: on "aminsert" mark the index as INVALID. So, if you modifiedthe table you need to run REINDEX/CREATE INDEX CONCURRENTLY beforedoing SELECT. This is almost the same as create index concurrently,the main difference is you don't have to remember to drop the indexbefore writing. (I don't see much benefit here)
Mode 2: on "aminsert", put the new entry in a plain, unordered listinstead of the btree. Inserting at the end of a list should be fasterthan big btrees and you'll know later which entries you missed indexing.
Mode 2.a: on index scan (amrescan, amgettuple), pretend that after thebtree there is the list and output every row, out-of order. You willhave to tell postgres that our index isn't sorted and it will have torecheck every row.
Mode 2.b: mark the index invalid instead. When doing the next vacuum,sort the list and insert it to the btree in a bulk operation. If it'sok, mark the index valid.
Mode 3: on "aminsert", put the new entry on a second btree; leavingthe first one untouched. Because the second btree is new, will besmall, and writes should be faster. When doing a index scan, readtuples from both at same time (like merge sort). On vacuum, merge thesecond btree onto the first. On this mode, the index is sorted andthere's no need of recheck.
Anyone thinks this would be a interesting feature for postgresql?
Did I miss something?
PD: Maybe it's also possible to take advantage of clustering, and haveindexes which entries are range of TIDs; but i'm not sure if this istoo exotic, or if it will make a difference.
Sincerely,
David.

How about a hybrid indexing system, with 2 parts:

(1) existing index system which is checked first and has been mostlyoptimised for speed of reading. If there are only a few inserts/updatesand the system is not heavily loaded, then it gets modifiedimmediately. The threshold for being too busy, and few enough changes,could be configurable.

(2) overflow index optimised for writing. Possible in memory and notbacked to permanent storage. A crash would require a complete indexrebuild - but only when there were entries in it (or at least more thansome configurable threshold, to allow for cases were some missing indexentries are acceptable).

So where the index is needed for a query, part 1 is checked first, andthe part 2 if necessary

Have a background process that will move entries from part 2 to part 1,when the systems is less busy.



Cheers,
Gavin





--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Is it possible to have a "fast-write" Index?

Reply via email to