Re: Out-of-order distinct

Grant Ingersoll Wed, 14 Jun 2006 16:17:28 -0700

You could implement your own HitCollector interface and remove lowerscoring duplicates as you come across them by using a Map or somethingto keep track as you go.


Ken Kinder wrote:

I've poked around on google and the archives quite a bite, but I can't
find exactly what I need. Say I have a query that would normally
return a set of documents:


1 002 (text...)
2 001 (text...)
3 001 (text...)
4 002 (text...)
5 004 (text...)

I'd like that modified to be:

1 002 (text...)
2 001 (text...)
5 004 (text...)

So the ordering is the same, but I only want the first 001 in the
result set -- skip all the rest.

Does this make sense? Is there a way to do it?

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

--

Grant IngersollSr. Software EngineerCenter for Natural Language ProcessingSyracuse UniversitySchool of Information Studies335 Hinds HallSyracuse, NY 13244http://www.cnlp.orgVoice: 315-443-5484Fax: 315-443-6886


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Out-of-order distinct

Reply via email to