Re: Zedstore - compressed in-core columnar storage

Tomas Vondra Sun, 14 Apr 2019 09:40:31 -0700

On Thu, Apr 11, 2019 at 06:20:47PM +0300, Heikki Linnakangas wrote:

On 11/04/2019 17:54, Tom Lane wrote:
Ashwin Agrawal <[email protected]> writes:
Thank you for trying it out. Yes, noticed for certain patterns pg_lzcompress() 
actually requires much larger output buffers. Like for one 86 len source it 
required 2296 len output buffer. Current zedstore code doesn’t handle this case 
and errors out. LZ4 for same patterns works fine, would highly recommend using 
LZ4 only, as anyways speed is very fast as well with it.
You realize of course that *every* compression method has some inputs that
it makes bigger.  If your code assumes that compression always produces a
smaller string, that's a bug in your code, not the compression algorithm.
Of course. The code is not making that assumption, although clearlythere is a bug there somewhere because it throws that error. It'searly days..
In practice it's easy to weasel out of that, by storing the datauncompressed, if compression would make it longer. Then you need anextra flag somewhere to indicate whether it's compressed or not. Itdoesn't break the theoretical limit because the actual stored lengthis then original length + 1 bit, but it's usually not hard to find aplace for one extra bit.


Don't we already have that flag, though? I see ZSCompressedBtreeItem has
t_flags, and there's ZSBT_COMPRESSED, but maybe it's more complicated.


--
Tomas Vondra                  http://www.2ndQuadrant.com

PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: Zedstore - compressed in-core columnar storage

Reply via email to