According to Olivier Dupuis:
>     I'm presently trying to add pdf indexing to htdig.  Everything's 
> installed, everything seems to be well configured but when running 
> rundig -vvv, I have the following errors:
> 
>     External parser error in line:... mais �a peut co�ter cher** !
>  URL: http://www.uottawa.ca/services/ccs/htdig/plagiat.pdf
> External parser error in line:Attention
>  URL: http://www.uottawa.ca/services/ccs/htdig/plagiat.pdf
> External parser error in line:au plagiat* !
>  URL: http://www.uottawa.ca/services/ccs/htdig/plagiat.pdf
> External parser error in line:
>  URL: http://www.uottawa.ca/services/ccs/htdig/plagiat.pdf
> ...........
> ...........
> 
>     Is there any way to get more details as to what exactly is the 
> problem?

It would help to know what your external_parsers attribute is set to,
but based on the errors above, I'm fairly confident that you're trying
to use an external converter as an external parser.  Here are correct
examples of each one...

External converter...

  external_parsers:     application/pdf->text/html /usr/local/bin/conv_doc.pl

External parser...

  external_parsers:     application/pdf /usr/local/bin/parse_doc.pl

Note that for external converters, you must explicitly specify what
type of output htdig should expect form the converter.  If that isn't
specified, htdig assumes it's an external parser that will produce
records in the very specific format that htdig expects.  The conv_doc.pl
and doc2html.pl scripts are two commonly used external converters, and
both of them include examples of their correct usage in their comments.
For more details on this, have a closer look at

    http://www.htdig.org/attrs.html#external_parsers  and
    http://www.htdig.org/FAQ.html#q4.9

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to