I've created a new dB directory and config file.
My server's directory tree consists of about 30 sub-directories.
The main directory uses index.html
Each sub-directory uses either index.cgi, or index.html
If it uses index.cgi, it ultimately points to main.html
I can't for the life of me get Htdig to dig though the entire tree.
These are the pertinent parts of my config file:
database_dir: /www/htdig/db_com
start_url: http://com.sb.usps.org/
limit_urls_to: com.sb.usps.org
exclude_urls: /cgi-bin/
max_head_length: 60000
search_algorithm: exact:1 synonyms:0.5 endings:0.1 soundex:0.5
metaphone:0.5
maintainer: [EMAIL PROTECTED]
allow_numbers: true
allow_virtual_hosts: true
local_urls: http://com.sb.usps.org=/nat/
create_url_list: true
url_list: /www/htdig/url_list_com
timeout: 42
Any simple ideas (I have several more url's that need similar treatment.
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.