Ximin Luo:
> Holger Levsen:
>> Hi Ximin,
>>
>> On Mon, Jun 19, 2017 at 04:08:32PM +0200, Ximin Luo wrote:
>>> Since 83 diffoscope supports a --json format output that can be read back 
>>> into
>>> another instance of diffoscope later. This allows people to experiment with
>>> generating different format outputs, using different settings such as higher
>>> limits, or even their local custom presenters that they haven't released 
>>> yet,
>>> testing them with "real" output that is hard to make up manually.
>>
>> I fear this might take a little while, we're currently a bit limited
>> on diskspace:
>>
>>          * h01ger wonders if we can compress all those diffoscope .txt and 
>> .html files…
>> < mapreri> the .txt, definitely
>> < mapreri> .html… I fear apache needs teaching for those
>> <  h01ger> 72G     dbd
>> <  h01ger> 58G     dbdtxt
>> <  h01ger> out of 190G used in total…
>> <  h01ger> adding buster was quite expensive :)
>> < mapreri> and infinity0 asked for json too now /o\
>> < mapreri> actually, I'm kind of surprised we're not compressing the txt 
>> already
>> <  h01ger> definitly first should compress the above
>>
>> …and that partition is "only" 220GB in size…
>>
> 
> Compressing text and json without worrying about apache would definitely be 
> fine for me. I don't think anyone is reading those, or would want to read 
> those, in their web browser. The primary purpose of the json is to be 
> downloaded and read back into diffoscope anyways.
> 

xz compression would save even more disk space. For diffoscope we probably want 
to pass in a bigger dictionary size, since some diffs are big and the default 
xz settings don't work on those.

With the GCC-6 diff I recently did, I experienced:

xz --lzma2=dict=1536MiB -9 # compresses the best, but very very slow
lrzip -L9                  # compresses slightly worse than the previous, but 
very quick
xz --lzma2=dict=1536MiB -2 # similar to lrzip

The nice thing about lrzip is it will auto-adjust dictionary size based on 
available RAM, but ofc users have to install a non-standard tool to decompress 
it. 1536MiB is the max size for xz.

X

-- 
GPG: ed25519/56034877E1F87C35
GPG: rsa4096/1318EFAC5FBBDBCE
https://github.com/infinity0/pubkeys.git

Reply via email to