Re: LZMA compression

Matt Campbell Mon, 12 Feb 2007 13:35:40 -0800

Alex Pelts wrote:

I did not know that single stream was maintained. How do new clientsjoin the session? Is dictionary replicated when new client joins thesession?

In the ZRLE encoding, a zlib stream is maintained for each connectedclient. The same would be done for LZMA or any other compression method.

Having given the matter some more thought, I'd like to propose a newencoding, tentatively called LZMA Cells.


The LZMA Cells encoding:

This encoding divides the rectangle into variable-height rows, each ofwhich consists of variable-width sub-rectangles called cells. For eachrow, the sum of the widths of all cells equals the width of the wholerectangle. Likewise, the sum of the heights of all rows equals theheight of the whole rectangle. The number of rows and the number ofcells in each row is therefore not explicitly specified in the encoding.

Each cell is represented by raw pixels, using the CPIXEL type as definedby the ZRLE encoding.

This encoding uses LZMA for compression. A single LZMA stream ismaintained for the duration of the RFB session, so all rectangles mustbe encoded and decoded strictly in order.

On the wire, the encoding consists of a four-byte length field followedby the specified number of bytes of LZMA-compressed data:


        U32 length;
        U8 lzmaData[length];

The uncompressed representation of the rectangle consists of anarbitrary number of rows. Each row begins with a field specifying itsheight in pixels:


        U16 rowHeight;

This field is followed by an arbitrary number of cells. Each cellconsists of a field specifying its width in pixels, followed by the rawpixel data:


        U16 cellWidth;
        CPIXEL data[cellWidth * rowHeight];

Rationale:

This encoding emphasizes simplicity and flexibility, leaving mostoptimization to the compressor. It does not support palettes orrun-length encoding; a good compression algorithm with a reasonablylarge dictionary renders these complications unnecessary. However, itdoes use the CPIXEL (compressed pixel) type, because in the common32-bit true-color pixel format, 8 bits for each pixel are never used.

This encoding attempts to facilitate the division of a large rectangleinto smaller sub-rectangles which contain patterns that the compressorcan recognize and efficiently compress. Because the optimalsub-rectangle size is subject to change, possibly even within a givenrectangle, and is not necessarily known yet, the encoding allows arectangle to be dviided into arbitrarily many sub-rectangles of varyingsizes. Current servers may simply use tiles of fixed size, as in theHextile and ZRLE encodings, but requiring this would be short-sighted.For example, a sophsticated server may be able to optimize the encodingof rendered text when each glyph has a distinct bounding box (e.g. mostcommon character sets when italics are not used). If each cellcorresponds to a glyph, the compressor may be able to compress therendered text more efficiently than otherwise, especially whencharacters or even sequences of characters recur frequently. In thesimpler case of fixed-size tiles, the compressor should be able tominimize the overhead of repeating the tile width and height.


Thoughts?

Matt
_______________________________________________
VNC-List mailing list
[email protected]
To remove yourself from the list visit:
http://www.realvnc.com/mailman/listinfo/vnc-list

Re: LZMA compression

Reply via email to