On Jul 11, 2008, at 1:13 PM, xin liu wrote:
I have individual index files for Audio, Image and PDF files. We build common meta fields for these different data types. When I search for a string, I want the search to return mixed search results from these 3 different index based on relevancy. So I use ParallelMultiSearcher class to do the search. But I also wants to know individual hit count for each individual index type. For example, I want to get:
   Mixed together total hit count: 103, with the first 10 HitItem.
   Total hit in Audio: 73
   Total hit in Image: 17
   Total hit in PDF: 13

Right now, I'm doing the following way:
1. Gets one Searchable instance for Audio, one for Image, and one for PDF index; 2. construct ParallelMultiSearcher s with above 3 Searchable as parameters; call its search to get total hit count and first 10 hit items;
3. Call Audio searchable to get total hit count in Audio;
4. Call Image searchable to get total hit count for Image;
5. Call PDF searchable to get total hit count for Image.

So, Lucene will need do 6 search operations for these 3 index. Definitely, the performance will be an issue.

Any better solution for this? Thanks!

Solr - <http://lucene.apache.org/solr> - features faceting along the lines of your needs.

However, Solr does not currently support ParallelMultiSearcher, but it does support distributed searching across sharded Solr instances. Under the covers of Solr is simply a Lucene index. There's no reason Solr couldn't be enhanced to support ParallelMultiSearcher, I don't think, but right now it only uses a single file based IndexSearcher.

        Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to