Hi, On 2015-04-19 22:51:53 +0200, Tomas Vondra wrote: > The reason why I'm asking about this is the multivariate statistics patch - > while optimizing the planning overhead, I realized that considerable amount > of time is spent decompressing the statistics (serialized as bytea), and > using an algorithm with better decompression performance (lz4 comes to mind) > would help a lot. The statistics may be a few tens/hundreds kB, and in the > planner every millisecond counts.
I've a hard time believing that a different compression algorithm is going to be the solution for that. Decompressing, quite possibly repeatedly, several hundreds of kb during planning isn't going to be fun. Even with a good, fast, compression algorithm. But independent of this, it'd be good to make progress on it. > Would a differentiated approach work? That is, either adding an initdb > option allowing the user to choose an alternative compression algorithm (and > thus let him consider the possible patent issues), or using different > algorithms for different pieces of data (e.g. keep pglz for the user data, > and lz4 for statistics). > > The first option is quite trivial to implement - I already have an > experimental patch implementing that (attached, but a bit dirty). The second > option is probably more difficult (we'd have to teach tuple toaster about > multiple compression algorithms and pass that information somehow). Also, > I'm not sure it'd make the patent concerns go away ... Note that I had implemented the "second option" somewhere down the lane in the [2] you refer to. > [2] > http://www.postgresql.org/message-id/20130614230142.gc19...@awork2.anarazel.de Greetings, Andres Freund -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers