Hello,
there is something wrong when i run the patched program.
The old version show the following output:
htdig: Run complete
htdig: 1 server seen:
htdig: www.dlrg.de:80 16530 documents
htdig: Errors to take note of:
[lot of "Not found"- errors... :-/ ]
htmerge: Total word count: 27441
htmerge: Total documents: 0
htmerge: Total doc db size (in K): 0
The new version show:
htdig: Run complete
htdig: 1 server seen:
htdig: www.dlrg.de:80 2 documents
htmerge: Total word count: 54429
htmerge: Total documents: 1
htmerge: Total doc db size (in K): 5
and /usr/local/htdig/db is:
total 39633
-rw-r--r-- 1 root root 12162048 Oct 21 03:01 db.docdb
-rw-r--r-- 1 root root 216064 Oct 21 03:01 db.docs.index
-rw-r--r-- 1 root root 480256 Oct 19 14:36 db.soundex.db
-rw-rw-r-- 1 root root 13466247 Oct 21 03:01 db.wordlist
-rw-r--r-- 1 root root 14095360 Oct 21 03:01 db.words.db
(I've mailed this mail first time only to Geoff)
With the patch much more searchkeys get hits (, but not all yet).
but all the "Not found"- errors were gone.
So my question is if I had a configuration-error or if the patch has
something broken...
my config is:
allow_numbers: true
database_dir: /usr/local/htdig/db
exclude_urls: /cgi-bin/ .cgi
htnotify_sender: [EMAIL PROTECTED]
keywords_meta_tag_names: keywords description
limit_urls_to: ${start_url}
local_default_doc: index.html
local_urls: http://www.dlrg.de/=/home/www/
maintainer: [EMAIL PROTECTED]
max_description_length: 500
max_doc_size: 250000
max_head_length: 80000
pdf_parser: /usr/X11R6/bin/acroread
start_url: http://www.dlrg.de/
search_algorithm: exact:1 synonyms:0.6 soundex:0.4 endings:0.3
substring_max_words: 200
use_meta_description: true
valid_punktation: .!?$%&#*+�`"^
<and the settings for the pages>
Reiner
--
------------------------------------------------------------------------
Reiner Keller e-mail: [EMAIL PROTECTED]
WWW : http://www.cs.TU-Berlin.de/~dlrg
------------------------------------------------------------------------
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.