I personally use SpanNearQuey (span positions are always needed), and for
different fields I use FieldMaskingSpanQuery class.
I just choose one field name and then mask each SpanTermQuery's real field name
with this field via wrapper.
Maybe it can help.
--
Igor
03.04.2013, 06:59, Paul
Hello!
I have a ~20GB index and try to make a concurrent search over it.
The index has 16 segments, I run SpanQuery.getSpans() on each segment
concurrently.
I see really small performance improvement of searching concurrently. I
suppose, the reason is that the sizes of the segments are very
On Tue, Apr 2, 2013 at 2:29 PM, Igor Shalyminov
ishalymi...@yandex-team.ru wrote:
Hello!
Hi Igor,
I have a ~20GB index and try to make a concurrent search over it.
The index has 16 segments, I run SpanQuery.getSpans() on each segment
concurrently.
I see really small performance improvement
Yes, the number of documents is not too large (about 90 000), but the queries
are very hard. Although they're just boolean, a typical query can produce a
result with tens of millions of hits.
Single-threadedly such a query runs ~20 seconds, which is too slow. therefore,
multithreading is vital
On Tue, Apr 2, 2013 at 4:39 PM, Igor Shalyminov
ishalymi...@yandex-team.ru wrote:
Yes, the number of documents is not too large (about 90 000), but the queries
are very hard. Although they're just boolean, a typical query can produce a
result with tens of millions of hits.
How can there be
These are not document hits but text hits (to be more specific, spans).
For the search result it is necessary to have the precise number of document
and text hits and a relatively small number of matched text snippets.
I've tried several approaches to optimize the search algorithm but they
-Original Message-
From: Igor Shalyminov [mailto:ishalymi...@yandex-team.ru]
Sent: Tuesday, April 02, 2013 9:58 PM
To: java-user@lucene.apache.org
Subject: Re: How to use concurrency efficiently
These are not document hits but text hits (to be more specific, spans).
For the search result
Hi,
I've experimented a bit with MultiFieldQueryParser
(http://lucene.apache.org/core/4_2_0/queryparser/org/apache/lucene/queryparser/classic/MultiFieldQueryParser.html)
But it seems to search for each of a query's terms in each field specified in
the constructor. So, as the doc says, if you