On Mon, May 12, 2014 at 5:47 AM, Alexander Tsoy <alexan...@tsoy.me> wrote:

> В Sun, 11 May 2014 18:26:32 -0500
> Gordon Pettey <petteyg...@gmail.com> пишет:
>
> > A lot of small files (e.g. AUTHORS, ChangeLog
> >
> > FWIW: On my system, I have 59M of bz2 files in /usr/share/man and
> > /usr/share/doc. A short script to decompress those and recompress with xz
> > -6e reduced that to 36M.
>
> Very strange o_O
>
> Here is my test results. xz options: "--lzma2=preset=6e,dict=4MiB".
> Larger dictionary size does not improve compression ratio, I get
> even worse results with just "-6e" or "-9e". man-bz2 is a full copy of
> my /usr/share/man, man-xz is a recompressed one.
>
> Size comparison:
>
> $ du -s man-bz2/ man-xz/
> 82032   man-bz2/
> 82308   man-xz


Did you skip all the files that weren't bz2 in the first place, and
decompress bz2 before compressing with xz? My comparison script does not
include uncompressed files. It copies all the bz2 files to a new folder,
pipes those through bzip -d to xz -6e to files in another new folder, then
compares the total size of those folders. Out of 8576 compressed files,
only 464 were larger in xz than in bz2. A very bad timing test I just did
showed the total decompression time of all the xz files to be half that of
decompressing all the bz2 files. Working on getting that data per-file and
averages.

Reply via email to