Re: Tuning Solr caches with high commit rates (NRT)

2010-12-02 Thread stockii
not only the old index. automatic refresh ? XD -- View this message in context: http://lucene.472066.n3.nabble.com/Tuning-Solr-caches-with-high-commit-rates-NRT-tp1461275p2005738.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tuning Solr caches with high commit rates (NRT)

2010-12-02 Thread Peter Sturge
-with-high-commit-rates-NRT-tp1461275p2005738.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-16 Thread Peter Sturge
Many thanks, Peter K. for posting up on the wiki - great! Yes, fc = field cache. Field Collapsing is something very nice indeed, but is entirely different. As Erik mentions in the wiki post, using per-segment faceting can be a huge boon to performance. It does require the latest Solr trunk build

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Peter Karich
Just in case someone is interested: I put the emails of Peter Sturge with some minor edits in the wiki: http://wiki.apache.org/solr/NearRealtimeSearchTuning I found myself search the thread again and again ;-) Feel free to add and edit content! Regards, Peter. Hi Erik, I thought this

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Jonathan Rochkind
Awesome. I'm not sure his point 1 about facet.method=enum is still valid in Solr 1.4+. The fc facet.method was changed significantly in 1.4, and generally no longer takes a lot of memory -- for facets with many unique values, method fc in fact should take less than enum, I think? Peter

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Peter Karich
Hi Jonathan, I am too using fc because it simply was faster. Not sure if this can be applied in general. I will add this info to the wiki. Regards, Peter. Awesome. I'm not sure his point 1 about facet.method=enum is still valid in Solr 1.4+. The fc facet.method was changed significantly

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Dennis Gearon
=nl.e036' EARTH has a Right To Life, otherwise we all die. - Original Message From: Peter Karich peat...@yahoo.de To: solr-user@lucene.apache.org Sent: Mon, November 15, 2010 1:37:00 PM Subject: Re: Tuning Solr caches with high commit rates (NRT) Hi Jonathan, I am too using fc

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Jonathan Rochkind
://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036' EARTH has a Right To Life, otherwise we all die. - Original Message From: Peter Karich peat...@yahoo.de To: solr-user@lucene.apache.org Sent: Mon, November 15, 2010 1:37:00 PM Subject: Re: Tuning Solr caches with high commit rates (NRT) Hi

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Peter Karich
, November 15, 2010 1:37:00 PM Subject: Re: Tuning Solr caches with high commit rates (NRT) Hi Jonathan, I am too using fc because it simply was faster. Not sure if this can be applied in general. I will add this info to the wiki. Regards, Peter. Awesome. I'm not sure his point 1 about facet.method

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Koji Sekiguchi
(10/11/16 6:43), Dennis Gearon wrote: fc='field collapsing'? fc of facet.method=fc stands for Lucene's FieldCache. enum of facet.method=enum stands for Lucene's TermEnum. Usually, you do not need to set facet.method because Solr automatically uses most appropriate facet method for each field

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Jonathan Rochkind
Koji Sekiguchi wrote: Usually, you do not need to set facet.method because Solr automatically uses most appropriate facet method for each field type: boolean: TermEnum multiValued/tokenized: UnInvertedField other than those above: FieldCache As I understand it, in Solr 1.4, (and I may NOT

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Koji Sekiguchi
(10/11/16 8:36), Jonathan Rochkind wrote: In Solr 1.4, facet.method=enum DOES work on multi-valued fields, I'm pretty certain. Correct, and I didn't say that facet.method=enum doesn't work for multiValued/tokenized field in my previous mail. I think Koji's explanation is based on before

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Peter Sturge
Hi, It's great to see such a fantastic response to this thread - NRT is alive and well! I'm hoping to collate this information and add it to the wiki when I get a few free cycles (thanks Erik for the heads up). In the meantime, I thought I'd add a few tidbits of additional information that

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Dennis Gearon
Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Friday, September 17, 2010, 2:18 AM Hi, It's great to see such a fantastic response to this thread - NRT is alive and well! I'm hoping to collate this information and add it to the wiki

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Erick Erickson
--- On Fri, 9/17/10, Peter Sturge peter.stu...@gmail.com wrote: From: Peter Sturge peter.stu...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Friday, September 17, 2010, 2:18 AM Hi, It's great to see such a fantastic

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Dennis Gearon
: From: Erick Erickson erickerick...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Friday, September 17, 2010, 10:05 AM Near Real Time... Erick On Fri, Sep 17, 2010 at 12:55 PM, Dennis Gearon gear...@sbcglobal.netwrote: BTW

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Andy
Does Solr use Lucene NRT? --- On Fri, 9/17/10, Erick Erickson erickerick...@gmail.com wrote: From: Erick Erickson erickerick...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Friday, September 17, 2010, 1:05 PM Near Real Time

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-17 Thread Peter Sturge
wrote: From: Erick Erickson erickerick...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Friday, September 17, 2010, 1:05 PM Near Real Time... Erick On Fri, Sep 17, 2010 at 12:55 PM, Dennis Gearon gear...@sbcglobal.netwrote: BTW

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-14 Thread Peter Karich
Hi Peter, this scenario would be really great for us - I didn't know that this is possible and works, so: thanks! At the moment we are doing similar with replicating to the readonly instance but the replication is somewhat lengthy and resource-intensive at this datavolume ;-) Regards, Peter.

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Dennis Gearon
Rutherglen jason.rutherg...@gmail.com wrote: From: Jason Rutherglen jason.rutherg...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Sunday, September 12, 2010, 7:52 PM Yeah there's no patch... I think Yonik can write it. :-)  Yah

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Peter Sturge
The balanced segment merging is a really cool idea. I'll definetely have a look at this, thanks! One thing I forgot to mention in the original post is we use a mergeFactor of 25. Somewhat on the high side, so that incoming commits aren't trying to merge new data into large segments. 25 is a good

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Peter Sturge
1. You can run multiple Solr instances in separate JVMs, with both having their solr.xml configured to use the same index folder. You need to be careful that one and only one of these instances will ever update the index at a time. The best way to ensure this is to use one for writing only, and

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Peter Sturge
Hi Erik, I thought this would be good for the wiki, but I've not submitted to the wiki before, so I thought I'd put this info out there first, then add it if it was deemed useful. If you could let me know the procedure for submitting, it probably would be worth getting it into the wiki (couldn't

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Peter Sturge
: Jason Rutherglen jason.rutherg...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Sunday, September 12, 2010, 7:52 PM Yeah there's no patch... I think Yonik can write it. :-)  Yah... The Lucene version shouldn't matter.  The distributed

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Simon Willnauer
...@gmail.com wrote: From: Jason Rutherglen jason.rutherg...@gmail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Sunday, September 12, 2010, 7:52 PM Yeah there's no patch... I think Yonik can write it. :-)  Yah... The Lucene version shouldn't

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-13 Thread Dennis Gearon
Willnauer simon.willna...@googlemail.com Subject: Re: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Monday, September 13, 2010, 1:33 AM On Mon, Sep 13, 2010 at 8:02 AM, Dennis Gearon gear...@sbcglobal.net wrote: BTW, what is a segment? On the Lucene

Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Peter Sturge
Hi, Below are some notes regarding Solr cache tuning that should prove useful for anyone who uses Solr with frequent commits (e.g. 5min). Environment: Solr 1.4.1 or branch_3x trunk. Note the 4.x trunk has lots of neat new features, so the notes here are likely less relevant to the 4.x

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Erick Erickson
Peter: This kind of information is extremely useful to document, thanks! Do you have the time/energy to put it up on the Wiki? Anyone can edit it by creating a logon. If you don't, would it be OK if someone else did it (with attribution, of course)? I guess that by bringing it up I'm volunteering

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Dennis Gearon
: From: Peter Sturge peter.stu...@gmail.com Subject: Tuning Solr caches with high commit rates (NRT) To: solr-user@lucene.apache.org Date: Sunday, September 12, 2010, 9:26 AM Hi, Below are some notes regarding Solr cache tuning that should prove useful for anyone who uses Solr with frequent

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Peter Karich
Peter, thanks a lot for your in-depth explanations! Your findings will be definitely helpful for my next performance improvement tests :-) Two questions: 1. How would I do that: or a local read-only instance that reads the same core as the indexing instance (for the latter, you'll need

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Jason Rutherglen
Peter, Are you using per-segment faceting, eg, SOLR-1617? That could help your situation. On Sun, Sep 12, 2010 at 12:26 PM, Peter Sturge peter.stu...@gmail.com wrote: Hi, Below are some notes regarding Solr cache tuning that should prove useful for anyone who uses Solr with frequent commits

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Peter Sturge
Hi Jason, I've tried some limited testing with the 4.x trunk using fcs, and I must say, I really like the idea of per-segment faceting. I was hoping to see it in 3.x, but I don't see this option in the branch_3x trunk. Is your SOLR-1606 patch referred to in SOLR-1617 the one to use with 3.1?

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Lance Norskog
Bravo! Other tricks: here is a policy for deciding when to merge segments that attempts to balance merging with performance. It was contributed by LinkedIn- they also run indexsearch in the same instance (not Solr, a different Lucene app).

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Chris Haggstrom
Thanks, Peter. This is really great info. One setting I've found to be very useful for the problem of overlapping onDeskSearchers is to reduce the value of maxWarmingSearchers in solrconfig.xml. I've reduced this to 1, so if a slave is already busy doing pre-warming, it won't try to also

Re: Tuning Solr caches with high commit rates (NRT)

2010-09-12 Thread Jason Rutherglen
Yeah there's no patch... I think Yonik can write it. :-) Yah... The Lucene version shouldn't matter. The distributed faceting theoretically can easily be applied to multiple segments, however the way it's written for me is a challenge to untangle and apply successfully to a working patch. Also