According to Dirk Kutsche:
> I have two problems with htdig, running on our SuSe-Linux-Server:
>
> 1. an example:
>
> Search results for '(camping or camp or camped or camper or camps or
> campers)'
>
> Match: ALL Format: LONG
> Refine search: camping
>
> Documents 1 - 6 of 6 matches. More 's indicate a better match.
>
> Pages:
>
> ht://Dig 3.1.0b2
>
> There are 6 matches - but no page is shown. What's wrong?
This is usually an indication of a corrupted database. If it's finding
matches, it's because it found the matching words in db.words.db.
However, it isn't finding the document records themselves in db.docdb,
which would suggest that either db.docdb, or db.docs.index (which maps
document IDs used in db.words.db to URLs used to look up records in
db.docdb), is messed up. You'll likely need to rebuild your database
from scratch.
There have been a LOT of bug fixes since 3.1.0b2 (which was released last
October). I'd recommend upgrading to 3.1.2 (released this April). You'll
need to rebuild your database anyway, so this might be the best time to
do it. 3.1.2 is much more stable than last year's beta releases, so you
should have less problems with database corruption.
> 2. I still have no experience how to search pdf-files. Is there a how-to
> somewhere what I have to do under the Linux-version?
No, there's no how-to, but there are a few FAQ entries:
http://www.htdig.org/FAQ.html#q5.2
http://www.htdig.org/FAQ.html#q4.9
In general, it comes down to a choice between using acroread (Adobe
Acrobat Reader 3 or 4), or using parse_doc.pl (with pdftotext from
the xpdf 0.80 package). The parse_doc.pl script seems to be faster for
small PDFs, but bogs down for larger documents.
If you use acroread, see
http://www.htdig.org/attrs.html#pdf_parser
and if you decide to use parse_doc.pl, see
http://www.htdig.org/attrs.html#external_parsers
as well as FAQ 4.9.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.