Hi,
 
Htdig 3.1.5 is installed on our SuSE 7.0 Server running as a guest under VM operating system.
The indexing has been great up until a couple of weeks ago.
Htdig follows the links to word document files on file.html.
It now randomly says that a certain word document cannot be found, but when I click on the link to this document, it opens up.  Usually everyday, htdig reports that one or two documents are not found and consequently the word database shrinks.
When I run htdig from commandline with same options that I have been using since I first put htdig into production, the robots text file prevents the index b/c the directories are disallowed.
 
/opt/www/htdig/bin/htdig -u username:password -s -c /opt/www/htdig/conf/htdig.conf
 
I understand the concept behind the robots.txt file, but why now does it not allow indexing when before it allowed access.  Nothing has changed in the robots.txt file or with the web server.
 
Regards,
Darren
 

Reply via email to