1- SOLR 4.6
2- We do but right now I am talking about plain keyword queries just sorted
by date. Once this is better will start looking into caches which we
already changed a little.
3- As I said the contents are not stored in this index. Some other metadata
fields are but with normal queries its super fast so I guess even if I
change there it will be a minor difference. We have SSD and quite fast too.
4- That's something we need to do but even in low workload those queries
take a lot of time
5- Every 10 mins and currently no auto warming as user queries are rarely
same and also once its fully warmed those queries are still slow.
6- Nops.

On Thu, Mar 13, 2014 at 5:38 PM, Dmitry Kan <solrexp...@gmail.com> wrote:

> 1. What is your solr version? In 4.x family the proximity searches have
> been optimized among other query types.
> 2. Do you use the filter queries? What is the situation with the cache
> utilization ratios? Optimize (= i.e. bump up the respective cache sizes) if
> you have low hitratios and many evictions.
> 3. Can you avoid storing some fields and only index them? When the field is
> stored and it is retrieved in the result, there are couple of disk seeks
> per field=> search slows down. Consider SSD disks.
> 4. Do you monitor your system in terms of RAM / cache stats / GC? Do you
> observe STW GC pauses?
> 5. How often do you commit & do you have the autowarming / external warming
> configured?
> 6. If you use faceting, consider storing DocValues for facet fields.
>
> some solr wiki docs:
>
> https://wiki.apache.org/solr/SolrPerformanceProblems?highlight=%28%28SolrPerformanceFactors%29%29
>
>
>
>
>
> On Thu, Mar 13, 2014 at 8:52 AM, Salman Akram <
> salman.ak...@northbaysolutions.net> wrote:
>
> > Well some of the searches take minutes.
> >
> > Below are some stats about this particular index that I am talking about:
> >
> > Index size = 400GB (Using CommonGrams so without that the index is around
> > 180GB)
> > Position File = 280GB
> > Total Docs = 170 million (just indexed for searching - for highlighting
> > contents are stored in another index)
> > Avg Doc Size = Few hundred KBs
> > RAM = 384GB (it has other indexes too but still OS cache can have 60-80%
> of
> > the total index cached)
> >
> > Phrase queries run pretty fast with CG but complex versions of wildcard
> and
> > proximity queries can be really slow. I know using CG will make them slow
> > but they just take too long. By default sorting is on date but users have
> > few other parameters too on which they can sort.
> >
> > I wanted to avoid creating multiple indexes (maybe based on years) but
> > seems that to search on partial data that's the only feasible way.
> >
> >
> >
> >
> > On Wed, Mar 12, 2014 at 2:47 PM, Dmitry Kan <solrexp...@gmail.com>
> wrote:
> >
> > > As Hoss pointed out above, different projects have different
> > requirements.
> > > Some want to sort by date of ingestion reverse, which means that having
> > > posting lists organized in a reverse order with the early termination
> is
> > > the way to go (no such feature in Solr directly). Some other projects
> > want
> > > to collect all docs matching a query, and then sort by rank, but you
> > cannot
> > > guarantee, that the most recently inserted document is the most
> relevant
> > in
> > > terms of your ranking.
> > >
> > >
> > > Do your current searches take too long?
> > >
> > >
> > > On Tue, Mar 11, 2014 at 11:51 AM, Salman Akram <
> > > salman.ak...@northbaysolutions.net> wrote:
> > >
> > > > Its a long video and I will definitely go through it but it seems
> this
> > is
> > > > not possible with SOLR as it is?
> > > >
> > > > I just thought it would be quite a common issue; I mean generally for
> > > > search engines its more important to show the first page results,
> > rather
> > > > than using timeAllowed which might not even return a single result.
> > > >
> > > > Thanks!
> > > >
> > > >
> > > > --
> > > > Regards,
> > > >
> > > > Salman Akram
> > > >
> > >
> > >
> > >
> > > --
> > > Dmitry
> > > Blog: http://dmitrykan.blogspot.com
> > > Twitter: http://twitter.com/dmitrykan
> > >
> >
> >
> >
> > --
> > Regards,
> >
> > Salman Akram
> >
>
>
>
> --
> Dmitry
> Blog: http://dmitrykan.blogspot.com
> Twitter: http://twitter.com/dmitrykan
>



-- 
Regards,

Salman Akram

Reply via email to