On Fri, 25 Jan 2002, Gilles Detillieux wrote:
> Date: Fri, 25 Jan 2002 15:33:46 -0600 (CST)
> From: Gilles Detillieux <[EMAIL PROTECTED]>
> To: garsila Ndzmande <[EMAIL PROTECTED]>
> Cc: "ht://Dig mailing list" <[EMAIL PROTECTED]>
> Subject: Re: [htdig] rundig fails during PDF indexing
>
> According to garsila Ndzmande:
> > How can I prevent rundig from crashing and dumping core, due to Bad or
> > protected PDF files?
>
> Set you max_doc_size appropriately. See http://www.htdig.org/FAQ.html#q5.2
This may be the kind of problem that can only be solved on a case by case
basis, and unfortunately cannot be isolated without a -v redig;( There
are many PDF and MSWord documents, Excel sheets, etc., in our site that
we can browse without a problem; however, htdig would spit out error
messages during the dig;(
For example there is an excel file in our site, about 40K (well under
max_doc_size), which would cause rundig to generated an error message,
reading something like: malloc could not allocate memory..., but it would
not dump core or stop digging. I ended up running htdig with a -v to find
the culprit file name, and excluding it from the dig;)
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html