Hello,

The site I want to index with htdig has some very important words
containing a '-' sign. One of this word is 'nmr-check' which will I will
use as an example search:

If '-' is included in valid_punctuation a search for 'nmr-check' or
'nmrcheck' results  in three hits. But none excerpt is displayed though
max_head_length is set to 50000 and all pages containing 'nmr-check' are
smaller than 50kB.

If '-' is not included in valid_punctuation searching for 'nmrcheck'
gives the same result as mentioned before. Searching for 'nmr-check'
(using method=and) causes htsearch to display the three correct hits
including the excerpt. But in addition some 'wrong' results occur,
because htsearch interprets 'nmr-check' in this case as 'nmr and (check
or checked or checking or checker or checks or checkers)' what is
clearly not what the user is searching for.

Unfortunately my site is not publicly accessible, but I think the same
problem arises if you are searching e.g. for 'allow_numbers' at
www.htdig.org.

Any help or comment would be very appreciated.

Matthias
-- 
___________________________________________________________________
Matthias Waffenschmidt     E-Mail: [EMAIL PROTECTED]
NMR Software Department
Bruker Analytik GmbH
Rudolf-Plank-Str. 23                          Tel: +49-7243-504-483
D-76275 Ettlingen                             Fax: +49-7243-504-480
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to