> Most likely these particular PDF files are larger than the max_doc_size
> attribute you have set in your configuration file. Htdig is only
> fetching part of the file and passing this to the conversion utility.
> Truncated PDF files cannot be parsed.
Yes, this, combined with a few stupid problems, fixed this. I
underestimated the size of the files...
> You should also consider switching from parse_doc to doc2html, a new
> version of which should be available to download shortly.
Looking forward to it!
Thanks,
Dave
--
Dave Wreski
Lead Systems Engineer Guardian Digital, Inc.
(201) 934-9230 Pioneering. Open Source. Security.
[EMAIL PROTECTED] http://www.guardiandigital.com
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>