On Tue, 25 Sep 2001 [EMAIL PROTECTED] wrote:

> I know the plan is to allow phrase searching in 3.2 (which I *still* 
> contend should be called 4.0)

Perhaps so, but there was previously a "version 4.0" and this is not the
same thing. Besides, why should version numbers increase so much? Should
we call this ht://Dig 2002? Nah.

> my search log analyses, I find that most of the multiword searches 
> are for reasonable phrases, and that pages matching those phrases 

I'm assuming you're talking about some sort of proximity ranking. In other
words, if you performed a regular query and the queried words fell close
together on the page, it would score higher.

Yes, this is certainly considered. The catch is coming up with a way to
score this quickly. It seems like mathematically you want to compute
something like the minimum distance between all words in the query. But
this seems a bit costly. Certainly if you know of references on computing
this proximity quickly, I'd be interested to read them.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to