On Dienstag, 20. Oktober 2009 13:57:08 [email protected] wrote:
> Hi,
>
> I help currently someone wanting to build his own ZIM parser.
> I know this is not necessary... but he wants to do it and I find this is
> interesting to have someone trying to do that.
>
> I have remarked that on the wiki nothing is written about the compression:
> * How do we know if a cluster is compressed or not ?
> * What are the possible value (for different compression methods) ?
>
> Regards
> Emmanuel
>
> _______________________________________________
> dev-l mailing list
> [email protected]
> https://intern.openzim.org/mailman/listinfo/dev-l
Hi,
unfortunately there are several areas, which are not specified completely. But
luckily this compression flag is at least partly ;-)
The first byte in a cluster specifies the compression. The value is:
0 default (no compression)
1 none (also no compression, I don't know, why vlado specified
this in zeno,
but I take ist over to zim)
2 zip (zlib)
3 bzip2 (currently used in writer)
4 lzma (not implemented in reader or writer due to lack of
compression
library)
You can find the flag here: http://openzim.org/ZIM_File_Format#Clusters. The
actual values are not documentated but can be found in the header zim/zim.h as
a enum.
It is really no necessary to implement a parser but it does not hurt. It helps
discussing ideas for improvements. And it helps pushing zim as a standard, if
we encourage people to work with it.
Tommi
_______________________________________________
dev-l mailing list
[email protected]
https://intern.openzim.org/mailman/listinfo/dev-l