|
Hi,
Htdig 3.1.5 is installed on our SuSE 7.0
Server running as a guest under VM operating system.
The indexing has been great up until a couple of
weeks ago.
Htdig follows the links to word document files
on file.html.
It now randomly says that a certain word
document cannot be found, but when I click on the link to this document, it
opens up. Usually everyday, htdig reports that one or two documents are
not found and consequently the word database shrinks.
When I run htdig from commandline with same
options that I have been using since I first put htdig into production, the
robots text file prevents the index b/c the directories are
disallowed.
/opt/www/htdig/bin/htdig -u username:password -s -c
/opt/www/htdig/conf/htdig.conf
I understand the concept behind the robots.txt
file, but why now does it not allow indexing when before it allowed
access. Nothing has changed in the robots.txt file or with the web server.
Regards,
Darren
|
- Re: [htdig] htdig 3.1.5 Darren Zamrykut
- Re: [htdig] htdig 3.1.5 Gilles Detillieux

