On 03/27/09 02:21, Ilmari Karonen wrote:
> Brion Vibber wrote:
>> On 3/26/09 12:30 PM, Ilmari Karonen wrote:
>>> The Wikipedia article (what else?) on the format says the blocks are
>>> padded to byte boundaries, and some quick testing seems to support that.
>> That is a filthy lie. :)
>>
>> There is indeed no byte padding between blocks; it made my 
>> implementation of a parallel bzip2 compressor much harder and I never 
>> got round to finishing the decompressor.
> 
> You're right, I misread it.  Only the whole stream is padded. :(
> 
> (In my defense, that seems like such a moronic design choice that I 
> couldn't believe it could be true.  If you're going to waste 48 bits per 
> block on pi-in-BCD anyway, it seems silly to skimp on the 4 bits of per 
> block that'd be needed on average to pad to a byte boundary.)

http://www.bzip.org/1.0.5/bzip2-manual-1.0.5.html#limits

"Much of this complexity could have been avoided if the compressed size 
of each block of data was recorded in the data stream."

lacos

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to