It looks like field collapsing may be the key:
http://issues.apache.org/jira/browse/SOLR-236

But it also doesn't seem to be 'finalized' yet. I wonder how
performant it is with indexes of 50 million documents+?

On Thu, Jul 9, 2009 at 9:42 PM, shb<suh...@gmail.com> wrote:
> you can refer to the facet search of solr, that might help you.
>
> 2009/7/10 Bradford Stephens <bradfordsteph...@gmail.com>
>
>> Greetings,
>>
>> We've been experimenting with grouping fields returned from document
>> search results in Lucene, and we haven't gotten anything very
>> encouraging. Basically, the more results we return, the longer it
>> takes -- tens of seconds. Probably because we're doing expensive disks
>> seeks. I'm hoping the SOLR crew out there may provide some insight :)
>>
>> What we're trying to do is similar to SQL's "GROUP BY".  Let's say we
>> have documents indexed by keyword for a content body, and also indexed
>> by an Author name. If I search our document store (very large) for the
>> word "laptop", I would like to be able to calculate the 10 authors
>> that appeared the most.
>>
>> I've done some searching through the mailing list, but couldn't glean
>> much insight. What do you think?
>>
>> --
>> http://www.roadtofailure.com -- The Fringes of Scalability, Social
>> Media, and Computer Science
>>
>



-- 
http://www.roadtofailure.com -- The Fringes of Scalability, Social
Media, and Computer Science

Reply via email to