Re: Improving search performance

Jason Rutherglen Thu, 22 May 2008 14:41:06 -0700

Query time boosting has no bottlenecks.  Storing will not affect
performance.  You will probably want to use PrefixFilter and
ConstantScoreRangeQuery.  Solr has ConstantScorePrefixQuery.  Simply means
if the document contains the term, the result will show, the scoring will
not be quite the same because there is one score no matter how many times
the term is in a document.


On Thu, May 22, 2008 at 1:57 PM, Rakesh Shete <[EMAIL PROTECTED]>
wrote:

>
> Hi Otis,
>
> Thanks for the quick response.
>
> Yes my application requirements mandate me to perform wildcard and range
> matches. I can't avoid it.
>
> HibernateSearch(HS) uses a kind of IndexReader (HS specific) for optimizing
> querying over indexes. I'll explore on the idea of using a pool of
> searchers.
>
> However, should just indexing the data for each fields without storing them
> improve performance? My understanding is that this will reduce the size of
> the index and hence improve performance.
>
> Also does query time boosting has any performance bottlenecks?
>
> Like: +(t:boss*^10 d:boss*^5 dd:boss*^3 tg:boss*^2) +st:act +ntid:0 +cid:1
> +dr:[20080410 TO 20081010] +rT:[002 TO 005]
>
> Thanks,
> Rakesh S
>
>
> > Date: Thu, 22 May 2008 10:22:27 -0700
> > From: [EMAIL PROTECTED]
> > Subject: Re: Improving search performance
> > To: [email protected]
> >
> > Some quick feedback.  Those are all very expensive queries (wildcards and
> ranges).  The first thing I'd do is try without Hibernate Search (to make
> sure HS is not the bottleneck).  100 threads is a lot, I'm guessing you are
> reusing your searcher, which is good, but you will actually improve
> performance a bit if you work with a small pool of searchers instead of a
> single searcher.
> >
> > Otis
> > --
> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >
> >
> > ----- Original Message ----
> > > From: Rakesh Shete <[EMAIL PROTECTED]>
> > > To: [EMAIL PROTECTED]; [email protected]
> > > Sent: Thursday, May 22, 2008 1:16:13 PM
> > > Subject: Improving search performance
> > >
> > >
> > > Hi all,
> > >
> > > I have index of size 85MB. My query looks as follows:
> > >
> > > +(t:boss* d:boss* dd:boss* tg:boss*) +st:act +ntid:0 +cid:1
> +dr:[20080410 TO
> > > 20081010] +rT:[002 TO 005]
> > >
> > > All the fields used in the query are stored in the indexes (Indexed &
> Stored)
> > >
> > > The query response time for me is around 30 seconds when running
> mutliple
> > > simultanoeous threads (~100). The no. of matches is ~30k but I retrieve
> only the
> > > top 100 results. I am using Hibernate Search which is a wrapper around
> Lucene. I
> > > retrieve the "id" filed from the index which is also indexex and
> stored.
> > >
> > > What is the approach that I should take for improving the performance?
> > >
> > > Will just indexing the values without storing them work (Index &
> UnStored)?
> > >
> > > My machine configuration is:
> > > P4 2.66GHz 1.99 GB RAM
> > >
> > > The code for searching runs in JBoss application server which has a
> maximum heap
> > > size of 1024MB. When these 100 threads are running in the application
> server the
> > > CPU utilization is 100% and JBoss consumes all of the heap size.
> > >
> > > Any pointers on index optimization would be really appreciated.
> > >
> > > --Regards,
> > > Rakesh Shete
> > >
> > > _________________________________________________________________
> > > No Harvard, No Oxford. We are here. Find out !!
> > > http://ss1.richmedia.in/recurl.asp?pid=500
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
>
> _________________________________________________________________
> Watch hottest Bollywood videos, clips, movie tailors, star interviews,
> songs and more on MSN videos.
> http://video.msn.com/?mkt=en-in
>

Re: Improving search performance

Reply via email to