Won't this just reject the whole URL?  If I understand the problem, Uta 
wants 
to throw out the session parameter but leave the rest intact, so the page 
if fetched
once only.  I've come across this issue before, and I've got a patch for 
3.2.x (which I
ported from our 3.1.6 version) but I think you could also use 
url_rewrite_rules too.
My patch is somewhat more specific than url_rewrite_rules, in that it just 
removes
unwanted parameters + value from the URL.  I'll post it along with my 
patch for 
ignoring the alt text from images (which is driven from a config option) 
in a few days -
I'm currently experimenting with tweaking the scoring to make an alternate 
'or'
behaviour which scores up results which contains more than one search 
term.

Short explanation by way of example: say I'm indexing a university 
website.  The 
chemistry department has a whole bunch of pages with the word chemistry 
all
through them, and one page telling students where to buy replacement lab 
glassware.
An 'and' search for 'chemistry glassware' finds this page and nothing 
else, an 'and'
search for 'chemistry glassware sales' finds nothing.  An 'or' search for 
'chemistry
glassware sales' is swamped by the occurance of 'chemistry' and 
'glassware' is
lost in the noise.  What I'm doing is factoring in the number of distinct 
words from
the search phrase found in the result to bump up score for pages with more 
than 
one search term.  Initial results look quite promising, but will need a 
bit of tuning to 
improve search speed.


Jamie Anstice
Search Engineer
S.L.I. Systems
[EMAIL PROTECTED]
ph:  64 961 3262
mobile: 64 21 264 9347




Geoff Hutchison <[EMAIL PROTECTED]>
Sent by: [EMAIL PROTECTED]
26/10/01 01:49

 
        To:     "Uta Becht" <[EMAIL PROTECTED]>
        cc:     "htdig" <[EMAIL PROTECTED]>
        Subject:        Re: [htdig-dev] Problemes with bad query-string


At 11:12 AM +0200 10/25/01, Uta Becht wrote:
>Can someone give me an idea at which position of htdig I should 
>eleminate this bad query_parameter ??

Why not use bad_querystr:

<http://www.htdig.org/attrs.html#bad_querystr>

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev




_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to