I am trying to determine if I have an incorrect configuration or if there is a
bug. When I search for an exact phrase, it will return some matches that don't
have the exact match, such as searching for "htdig exclude" on the
http://www.htdig.org site. It will return 6 matches, only 2 have exact matches
in them, the others are two ChangeLog files and two FAQ files. Any ideas?
I have tried this on Solaris 8, with htdig-3.2.0b4-20020714,
htdig-3.2.0b4-20020707, and htdig-3.2.0b3. Also on Solaris 2.6 with htdig-3.2.0b3.
rundig -vvv -s
ht://dig Start Time: Thu Jul 18 13:13:49 2002
0:1:http://www.htdig.org/THANKS.html
New server: www.htdig.org, 80
- Persistent connections: enabled
- HEAD before GET: disabled
- Timeout: 30
- Connection space: 0
- Max Documents: -1
- TCP retries: 1
- TCP wait time: 5
htdig.conf:
database_dir: /search/cassini/db
start_url: http://www.htdig.org/
limit_urls_to: ${start_url}
exclude_urls: /cgi-bin/ .cgi
bad_extensions: .wav .gz .z .sit .au .zip .tar .hqx .exe .com .gif \
.jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi .css
maintainer: [EMAIL PROTECTED]
max_head_length: 10000
max_doc_size: 200000
no_excerpt_show_top: true
search_algorithm: exact:1
--
Rob Kremer
JPL Cassini SA
818-393-1283 Fax: 393-4658
Office 230-311 M/S 230-310
--
-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html