On Tue, 25 Sep 2001 [EMAIL PROTECTED] wrote:
> I know the plan is to allow phrase searching in 3.2 (which I *still*
> contend should be called 4.0)
Perhaps so, but there was previously a "version 4.0" and this is not the
same thing. Besides, why should version numbers increase so much? Should
we call this ht://Dig 2002? Nah.
> my search log analyses, I find that most of the multiword searches
> are for reasonable phrases, and that pages matching those phrases
I'm assuming you're talking about some sort of proximity ranking. In other
words, if you performed a regular query and the queried words fell close
together on the page, it would score higher.
Yes, this is certainly considered. The catch is coming up with a way to
score this quickly. It seems like mathematically you want to compute
something like the minimum distance between all words in the query. But
this seems a bit costly. Certainly if you know of references on computing
this proximity quickly, I'd be interested to read them.
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html