Hello,

there is something wrong when i run the patched program.
The old version show the following output:

htdig: Run complete
htdig: 1 server seen:
htdig:     www.dlrg.de:80 16530 documents

htdig: Errors to take note of:
[lot of "Not found"- errors... :-/ ]
htmerge: Total word count: 27441
htmerge: Total documents: 0
htmerge: Total doc db size (in K): 0


The new version show:

htdig: Run complete
htdig: 1 server seen:
htdig:     www.dlrg.de:80 2 documents
htmerge: Total word count: 54429
htmerge: Total documents: 1
htmerge: Total doc db size (in K): 5


and /usr/local/htdig/db is:
total 39633
-rw-r--r--   1 root     root     12162048 Oct 21 03:01 db.docdb
-rw-r--r--   1 root     root       216064 Oct 21 03:01 db.docs.index
-rw-r--r--   1 root     root       480256 Oct 19 14:36 db.soundex.db
-rw-rw-r--   1 root     root     13466247 Oct 21 03:01 db.wordlist
-rw-r--r--   1 root     root     14095360 Oct 21 03:01 db.words.db
(I've mailed this mail first time only to Geoff)

With the patch much more searchkeys get hits (, but not all yet). 
but all the "Not found"- errors were gone.
So my question is if I had a configuration-error or if the patch has 
something broken...

my config is:
allow_numbers:          true
database_dir:           /usr/local/htdig/db
exclude_urls:           /cgi-bin/ .cgi
htnotify_sender:        [EMAIL PROTECTED]
keywords_meta_tag_names:        keywords description
limit_urls_to:          ${start_url}
local_default_doc:      index.html
local_urls:             http://www.dlrg.de/=/home/www/
maintainer:             [EMAIL PROTECTED]
max_description_length: 500
max_doc_size:           250000
max_head_length:        80000
pdf_parser:             /usr/X11R6/bin/acroread
start_url:              http://www.dlrg.de/
search_algorithm:       exact:1 synonyms:0.6 soundex:0.4 endings:0.3
substring_max_words:    200
use_meta_description:   true
valid_punktation:       .!?$%&#*+�`"^
<and the settings for the pages>


Reiner
-- 
------------------------------------------------------------------------
Reiner Keller                   e-mail: [EMAIL PROTECTED]
                                WWW   : http://www.cs.TU-Berlin.de/~dlrg
------------------------------------------------------------------------
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to