Re: Performance of Boolean query with hundreds of OR clauses.

2014-09-07 Thread Yonik Seeley
Solr 4.10 has added a {!terms} query that should speed up these cases.

Benchmarks here:
http://heliosearch.org/solr-terms-query/

-Yonik
http://heliosearch.org - native code faceting, facet functions,
sub-facets, off-heap data

On Tue, Aug 19, 2014 at 2:57 PM, SolrUser1543 osta...@gmail.com wrote:
 I am using Solr to perform search for finding similar pictures.

 For this purpose, every image indexed as a set of descriptors ( descriptor
 is a string of 6 chars ) .
 Number of descriptors for every image may vary ( from few to many thousands)

 When I want to search  for a similar image , I am extracting the descriptors
 from it and create a query like :
 MyImage:( desc1 desc2 ...  desc n )

 Number of descriptors in query may also vary. Usual it is about 1000.

 Of course performance of this query very bad and may take few minutes to
 return .

 Any ideas for performance improvement ?

 P.s I also tried to use lire , but it is not fits my use case.


Performance of Boolean query with hundreds of OR clauses.

2014-08-19 Thread SolrUser1543
I am using Solr to perform search for finding similar pictures. 

For this purpose, every image indexed as a set of descriptors ( descriptor
is a string of 6 chars ) .
Number of descriptors for every image may vary ( from few to many thousands)

When I want to search  for a similar image , I am extracting the descriptors
from it and create a query like :
MyImage:( desc1 desc2 ...  desc n )

Number of descriptors in query may also vary. Usual it is about 1000.

Of course performance of this query very bad and may take few minutes to
return . 

Any ideas for performance improvement ? 

P.s I also tried to use lire , but it is not fits my use case.  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Performance-of-Boolean-query-with-hundreds-of-OR-clauses-tp4153844.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Performance of Boolean query with hundreds of OR clauses.

2014-08-19 Thread Jack Krupansky
A large number of query terms is definitely an anti-pattern and not a 
recommended use case for Solr, but I'm a little surprised that it takes 
minutes, as opposed to 10 to 20 seconds.


Does your index fit entirely in the OS system memory available for file 
caching?


IOW, are those few minutes CPU-bound or I/O-bound?

-- Jack Krupansky

-Original Message- 
From: SolrUser1543

Sent: Tuesday, August 19, 2014 2:57 PM
To: solr-user@lucene.apache.org
Subject: Performance of Boolean query with hundreds of OR clauses.

I am using Solr to perform search for finding similar pictures.

For this purpose, every image indexed as a set of descriptors ( descriptor
is a string of 6 chars ) .
Number of descriptors for every image may vary ( from few to many thousands)

When I want to search  for a similar image , I am extracting the descriptors
from it and create a query like :
MyImage:( desc1 desc2 ...  desc n )

Number of descriptors in query may also vary. Usual it is about 1000.

Of course performance of this query very bad and may take few minutes to
return .

Any ideas for performance improvement ?

P.s I also tried to use lire , but it is not fits my use case.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Performance-of-Boolean-query-with-hundreds-of-OR-clauses-tp4153844.html
Sent from the Solr - User mailing list archive at Nabble.com.