Hello all,

nowadays I have implemented conv_doc.pl as general parser for PDF,
PostScript and M$ Word documents.
>From time to time I get error messages like:

--8<--8<--8<--

Error (0): PDF file is damaged - attempting to reconstruct xref table...
Error: Couldn't find trailer dictionary
Error: Couldn't read xref table
Error (0): PDF file is damaged - attempting to reconstruct xref table...
Error: Couldn't find trailer dictionary
Error: Couldn't read xref table
Error (139803): Bad colorspace

--8<--8<--8<--

Even though the 'max_doc_size' is set high enough for all PDFs to be parsed
correctly and the files are safe and sound (users can open/read them without
problems).
Therefore I wonder if this is a parser-dependant issue rather than a
configuration one. Maybe you have better experiences with other parsers
giving best results... I'd like to hear some before
downloading/installing/reconfiguring things here...

Thanks and best regards,

Martin


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>

Reply via email to