Erik Hatcher wrote:
Yeah, the Nutch code is highly intertwined with its unique configuration
infrastructure and makes it hard to pull pieces of it out like this.
This is a critique that has been heard a lot (mainly because its true :)
It would be really cool if different camps of lucene could
Very nice. This will help me also. I will try this out and let you know how
it goes. (Windows XP with a custom request handler and some other custom
classes)
Hi -
Recap:
I'd like to be able to limit the number of documents returned from
any particular group of documents, much as Google only shows a max of
two results from any one website.
The docs are all marked as to which group they belong to. There will
probably be multiple
On 11/15/06, Tom [EMAIL PROTECTED] wrote:
It looks like that for trimming, the places I want to modify are in
ScorePriorityQueue and FieldSortedHitQueue. When trimming, I want to
return the top item in the group that matches, whether by score or
sort, not just the first one that goes through the
At 01:35 PM 11/15/2006, you wrote:
On 11/15/06, Tom [EMAIL PROTECTED] wrote:
It looks like that for trimming, the places I want to modify are in
ScorePriorityQueue and FieldSortedHitQueue. When trimming, I want to
return the top item in the group that matches, whether by score or
sort, not just
On 11/15/06, Tom [EMAIL PROTECTED] wrote:
One hack: implement a simple query that delegates to another query and
encapsulates the trim value... that way hashCode/equals won't match
unless the trim does.
Not sure what you mean by delegates to another query. Could you
clarify or give me a
One other thing you'll need to watch out for is the filterCache ... Solr
has a setting (i forget the name at the moment) which tells the
SolrIndexSearcher that for sorted queries, it can reuse the DocSet from a
previous invocation of the Query and sort the cached DocSet to generate
the list --
: Yeah, the Nutch code is highly intertwined with its unique configuration
: infrastructure and makes it hard to pull pieces of it out like this.
that CacheGrams inner Filter classe seemed like it could be extracted
easily enough.
: This is a critique that has been heard a lot (mainly because