>At 6:30 AM -0400 1/11/99, Shyam B S wrote:
>>I am trying to index and Search MS Word and PDF files. I am using
catdoc
>>and acrobat as the external parsers for these documents. htsearch
finds
>
>Do you mean that you've specified catdoc and acrobat in the external
parser
>attribute? If so, it's not going to work reliably (if at all). The
external
>parser support expects output to follow certain guidelines documented
in
>http://www.htdig.org/attrs.html#external_parser so you can't just plug
any
>program in.
>
>If you're running any of the 3.1.0bX series, they include a PDF parser
that
>works with acrobat (and should work out of the box). More recent betas
>include scripts to handle Word documents using catdoc.
>
Thanks. I am using htparsedoc as the external parser which calls catdoc
for word docs. I could solve the problem, by modifying the htparsedoc to
return record type h along with record types title(t) and words(w).
Shyam
Shyam
______________________________________________________
Get Your Private, Free Email at http://www.hotmail.com
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.