Chris Hostetter wrote:
(Assuming *I* understand it) what he's talking baout, is the ability for
his search GUI to display suggested phrase searches you may want to try
which consist of the words you just typed in grouped into phrases.

Yes, that's precisely what I am talking about. Sorry for being unclear.


Presumably, if multiple phrases in the source data can be found in the
permutations of hte search words, the least common are the ones you'd want
to sugggest -- which makes the problem a sort of SIP problem (ie: given an
extremely limited set of words, find the Statistically imporbably phrases
in the corpus made using only subsets of those words)

I'd already be happy to get *any* phrases :-)

If the phrases could be ranked, I might prefer to pick the *most frequent* phrases. For example:

  anopheles anopheles malaria

("anopheles anopheles" is the latin name for the common mosquito)

I'd like to be able to suggest quoting this name to eliminate all the other mosquito species that also contain "anopheles" in their name.

There are lots of documents with "anopheles anopheles". There may also be a document or two where "anopheles" happens to appear next to "malaria", but these are less interesting here.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to