> Most likely these particular PDF files are larger than the max_doc_size
> attribute you have set in your configuration file.  Htdig is only
> fetching part of the file and passing this to the conversion utility.
> Truncated PDF files cannot be parsed.

Yes, this, combined with a few stupid problems, fixed this. I
underestimated the size of the files...

> You should also consider switching from parse_doc to doc2html, a new
> version of which should be available to download shortly.

Looking forward to it!

Thanks,
Dave
--
Dave Wreski
Lead Systems Engineer                       Guardian Digital, Inc.
(201) 934-9230                Pioneering.  Open Source.  Security.
[EMAIL PROTECTED]            http://www.guardiandigital.com

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>

Reply via email to