René Peinl wrote:
Hi guys, I'm sorry to bother you with this, but you seem to be the
only people reachable outside Microsoft that know about the XLS file
format. I've dealt myself a lot with the XML versions and also did
some minor enhancements to the OO XSLTs that convert WordML to OO,
but now I need some help with the binary version. I'm trying to write
a comparison function that compares two versions of a document with
each other and should return true if the documents have the same
content and false otherwise. I'm using an MD5 hash to do this. The
reason is, that I want to eliminate versions of documents in
Sharepoint where only metadata has changed. Unfortunately, Sharepoint
is so clever that it writes Metadata not only into its own database,
but also inside the document itself, if it is an office document
type. Therefore I want to strip off the header (and trailer) that
contains metadata. For doc files this is quite easy. I just had to
remove (or overwrite with zeros) the first 2554 and the last 1520
bytes and compare the files afterwards. Unfortunately this strategy
does not work with XLS files. It seems that every sheet inside the
file has it's own copy of metadata. Can you give me any advice, how
to get rid of the metadata (just for the comparison). Is there any
byte sequence I can search for and then overwrite the next x byte
with zeros? I would be really thankful for any help. Thanks a lot and


We have a complete description of the OLE2 container file format:
http://sc.openoffice.org/compdocfileformat.pdf

Regards
Daniel

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to