Re: LZMA inclusion

Phillip Lougher Sun, 07 Dec 2008 16:24:24 -0800

Jörn Engel wrote:

On Sat, 6 December 2008 23:56:50 +0200, Lasse Collin wrote:
Since you are improving the crypto API, maybe it would be a good idea toadd a flag to tell the decoder that the whole output buffer will bekept available to the multi-call decoder.
I'm not convinced this is the right direction.  One of the constraints
of kernel programming is that large contiguous are hard to come by.  The
mm subsystem makes no guarantees that you will be able to allocate 1MiB
or contiguous memory.  On a 32bit system with highmem, it may even
become hard to get 1MiB from vmalloc.

This is an important issue, on the last Squashfs submission attempt, itsuse of vmalloc to allocate up to 1MiB contiguous blocks fordecompression was brought up. Any LZMA implementation which requires1MiB vmalloced input and output buffers will probably face similar problems.


So another approach would be to ignore the one-shot debate and
concentrate on taking a pagevec instead of a buffer (as in a void *
pointer).  That would certainly be useful for other compressed
filesystems and without checking the code (I forgot where the squashfs
git tree was) I claim it should be useful for squashfs as well.

Squashfs doesn't use one-shot decoding with zlib for performance andmemory issues. Input data is split across buffer_heads (4 KiB or lessper buffer_head), and calling zlib repeatedly for each separatebuffer_head eliminates the necessary memcpy into a larger input buffer,eliminates the memory overhead for this buffer, and ensures only thefirst buffer_head needs to be waited on (for arrival off disk) beforedecompression starts.

Currently, as mentioned above, Squashfs decompresses into a singlecontiguous output buffer. But, due to the linux kernel mailing list'sdislike of vmalloc, this is being changed. In future Squashfs willdecompress into a sequence of 4 KiB output buffers (possibly in the pagecache).

One-shot LZMA decoding therefore isn't going to work very well withfuture versions of Squashfs, obviously a solution (as is currently donewith the Squashfs-LZMA patches) is to use separately allocatedcontiguous input/output buffers, and memcpy into and out of them, butthis isn't particularly ideal.

The discussion about using the output buffer as the temporary workspace(as it isn't touched until after decompression is completely finished)will work with the current version of Squashfs, but it isn't going towork with later versions unless the LZMA code can be changed to workwith a list of discontiguous output buffers (i.e. a scatter-gather typelist).

So it looks inevitable that a separately vmalloced workspace buffer willbe required.


Phillip


Jörn


--
To unsubscribe from this list: send the line "unsubscribe linux-embedded" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: LZMA inclusion

Reply via email to