Re: Defaults Merge Policy

2020-04-27 Thread Kayak28
gger and > >> tend to use time on the theory that it’s easier to explain, > >> whereas when commits happen when using maxDocs > >> varies depending on the throughput rate. > >> > >> Best, > >> Erick > >> > >>> On Apr 15, 2020,

Re: Defaults Merge Policy

2020-04-23 Thread Erick Erickson
that it’s easier to explain, >> whereas when commits happen when using maxDocs >> varies depending on the throughput rate. >> >> Best, >> Erick >> >>> On Apr 15, 2020, at 1:28 PM, Kayak28 wrote: >>> >>> Hello, Solr Community: >>&

Re: Defaults Merge Policy

2020-04-23 Thread Kayak28
> > On Apr 15, 2020, at 1:28 PM, Kayak28 wrote: > > > > Hello, Solr Community: > > > > I would like to ask about Default's Merge Policy for Solr 8.3.0. > > My client (SolrJ) makes a commit every 10'000 doc. > > I have not explicitly configured Merge

Re: Defaults Merge Policy

2020-04-15 Thread Erick Erickson
rate. Best, Erick > On Apr 15, 2020, at 1:28 PM, Kayak28 wrote: > > Hello, Solr Community: > > I would like to ask about Default's Merge Policy for Solr 8.3.0. > My client (SolrJ) makes a commit every 10'000 doc. > I have not explicitly configured Merge Polic

Defaults Merge Policy

2020-04-15 Thread Kayak28
Hello, Solr Community: I would like to ask about Default's Merge Policy for Solr 8.3.0. My client (SolrJ) makes a commit every 10'000 doc. I have not explicitly configured Merge Policy via solrconfig.xml For each indexing time, some documents are updated or deleted. I think the Def

Re: Reindex Required for Merge Policy Changes?

2020-02-25 Thread Zimmermann, Thomas
; > Few questions before I tackled an upgrade here. Looking to go from 7.4 to 7.7.2 to take advantage of the improved Tiered Merge Policy and segment cleanup – we are dealing with some high (45%) deleted doc counts in a few cores. Would simply upgrading Solr and setting the cores to u

Re: Reindex Required for Merge Policy Changes?

2020-02-24 Thread Erick Erickson
, at 5:42 PM, Zimmermann, Thomas > wrote: > > Hi Folks – > > Few questions before I tackled an upgrade here. Looking to go from 7.4 to > 7.7.2 to take advantage of the improved Tiered Merge Policy and segment > cleanup – we are dealing with some high (45%) deleted doc count

Reindex Required for Merge Policy Changes?

2020-02-24 Thread Zimmermann, Thomas
Hi Folks – Few questions before I tackled an upgrade here. Looking to go from 7.4 to 7.7.2 to take advantage of the improved Tiered Merge Policy and segment cleanup – we are dealing with some high (45%) deleted doc counts in a few cores. Would simply upgrading Solr and setting the cores to use

Re: merge policy & autocommit

2019-10-28 Thread Shawn Heisey
g a correctly configured autoCommit would substantially affect indexing speeds. The same can be true also for the merge policy? how the IO speed can affect the merge policy parameters? I kept the default merge policy configuration but it looks like it never merges segments. How can I know if a mer

merge policy & autocommit

2019-10-28 Thread Danilo Tomasoni
y our solr instance to the point of making it not responsive? The same can be true also for the merge policy? how the IO speed can affect the merge policy parameters? I kept the default merge policy configuration but it looks like it never merges segments. How can I know if a merge is happening

Re: Default merge policy

2018-10-12 Thread Shawn Heisey
what Erick said.  Erick's info is completely valid. For the version you are on, specifying a mergeFactor of 25 with no other merge-policy related config effectively results in this config:   25   25   30 I would recommend replacing mergeFactor with an explicit merge policy config.  Sinc

Re: Default merge policy

2018-10-12 Thread Erick Erickson
bq. However we have not specified it in the following way Is that a typo and you mean "have now specified"? There's code in SolrIndexConfig: if (policy instanceof TieredMergePolicy) { if (mergeFactor != -1) { tieredMergePolicy.setMaxMergeAtOnce(mergeFactor); tieredMergePolicy.setSegmen

Default merge policy

2018-10-12 Thread root23
Hi all, I am little bit confused. We are on solr 6. and as per the documentation i think solr 6 uses TieredMergePolicyFactory. However we have not specified it in the following way 10 10 We still use 25. which i understand is not used by TieredMergePolicyFactory. So my confusion is that w

Re: Changing merge policy config on production

2017-12-16 Thread alexpusch
Thanks Erick, good point on maxMergedSegmentMB, many of my segments really are max out. My index isn't 800G, but it's not far from it - it's about 250G per server. I have high confidence in Solr and my EC2 i3-2xl instances, so far I got pretty good results. -- Sent from: http://lucene.472066.n3

Re: Changing merge policy config on production

2017-12-16 Thread Erick Erickson
So I'm guessing you have something on the order of an 800G index? The max segment size is roughly 5G (by default) and assuming all your segments are close to the max size I get 160 * 5G = 800G, but that may be off. I think you're barking up the wrong tree if these numbers are close to correct. Th

Re: Changing merge policy config on production

2017-12-16 Thread alexpusch
To be clear - I'm talking about query performance, not indexing performance. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Changing merge policy config on production

2017-12-16 Thread alexpusch
Thanks for the quick answer Erick, I'm hoping to improve performance by reducing the number of segments. Currently I have ~160 segments. Am I wrong thinking it might improve performance? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Changing merge policy config on production

2017-12-15 Thread Erick Erickson
The merge rate will be limited by the number of merge threads. You'll merge more often though so the load will change. That said, I wouldn't be concerned unless you have a very high indexing rate. Why do you want to change anyway? Unless you've tried the new settings in a Dev environment, the bigg

Changing merge policy config on production

2017-12-15 Thread alexpusch
Hi, Is it safe to change the mergePolicyFactory config on production servers? Specifically maxMergeAtOnce and segmentsPerTier. How will solr reconcile the current state of the segments with the new config? In case of setting segmentsPerTier to a lower number - will subsequent merges be particulary

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread Erick Erickson
1> merging takes place up until the max segment size is reached (5G in the default TieredMergePolicy). 2> there are a couple of options, again config changes for TieredMergePolicy 10 might help. You could also try upping this (the default is 5G). 5000 Best, Erick On Mon, Oct 23, 2017 at 10:34

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread chandrushanmugasundaram
Thanks eric. (Beginner in solr). Few questions. 1. Does merging take place only when we have deleted docs? When my segments reach a count of 35+ the search is getting slow.Only on performing force merge to index the search is efficient. 2. Is there any way we can reduce the number of segments

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread chandrushanmugasundaram
Amrit, Thanks for your reply. I have removed that 1000 1 15 false 1024 2 2 hdfs 1 0 -- Sent from: http://lucene.472066.n3.nabble.com

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread Erick Erickson
And please define what you mean by "merging is not working". One parameter is max segments size, which defaults to 5G. Segments at or near that size are not eligible for merging unless they have around 50% deleted docs. Best, Erick On Mon, Oct 23, 2017 at 3:11 AM, Amrit Sarkar wrote: > Chandru,

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread Amrit Sarkar
Chandru, Didn't try the above config bu whyt have you defined both "mergePolicy" and "mergePolicyFactory"? and pass different values for same parameters? > 10 > 1 > > > 10 > 10 > > Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://

Merging is not taking place with tiered merge policy

2017-10-22 Thread Chandru Shanmugasundaram
The following is my solrconfig.xml 1000 1 15 false 1024 10 1 10 10 hdfs 1 0 Please let me know if should I tweak something above -- Thanks, Chandru.S

Re: Merge policy

2016-10-28 Thread Walter Underwood
er. That's why I was looking for an optimize. > > Do you have any idea why the merge policy does not merge away the deletions? > Should I tweak some parameters somehow? It's a default installation using the > default settings and parameters. If you need more info, just let

Re: Merge policy

2016-10-28 Thread Emir Arnautovic
ards with replication factor 2 so it's a lot of data. The deletions percentage at the bottom of the segment page is around 25%. So it's quite some space which we could recover. That's why I was looking for an optimize. Do you have any idea why the merge policy does not merge away

Re: Merge policy

2016-10-28 Thread Arkadi Colson
sizes seems odd. Why so many medium-large segments? Are there custom settings for merge policy? I think the default policy would avoid so many segments that are mostly deleted documents. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) On Oct 27, 2016,

Re: Merge policy

2016-10-28 Thread Arkadi Colson
imize. Do you have any idea why the merge policy does not merge away the deletions? Should I tweak some parameters somehow? It's a default installation using the default settings and parameters. If you need more info, just let me know... Thx! On 27-10-16 17:40, Erick Erickson wrote:

Re: Merge policy

2016-10-27 Thread Walter Underwood
That distribution of segment sizes seems odd. Why so many medium-large segments? Are there custom settings for merge policy? I think the default policy would avoid so many segments that are mostly deleted documents. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org

Re: Merge policy

2016-10-27 Thread Shawn Heisey
On 10/27/2016 9:50 AM, Yonik Seeley wrote: > On Thu, Oct 27, 2016 at 9:56 AM, Arkadi Colson > wrote: >> Thanks for the answer! Do you know if there is a way to trigger an >> optimize for only 1 shard and not the whole collection at once? > Adding a "distrib=false" parameter should work I think.

Re: Merge policy

2016-10-27 Thread Yonik Seeley
On Thu, Oct 27, 2016 at 9:56 AM, Arkadi Colson wrote: > Thanks for the answer! > Do you know if there is a way to trigger an optimize for only 1 shard and > not the whole collection at once? > Adding a "distrib=false" parameter should work I think. -Yonik

Re: Merge policy

2016-10-27 Thread Erick Erickson
, "Arkadi Colson" wrote: > >> Hi >> >> As you can see in the screenshot above in the oldest segments there are a >> lot of deletions. In total the shard has about 26% deletions. How can I get >> rid of them so the index will be smaller again? >> Can this o

Re: Merge policy

2016-10-27 Thread Arkadi Colson
re are a lot of deletions. In total the shard has about 26% deletions. How can I get rid of them so the index will be smaller again? Can this only be done with an optimize or does it also depend on the merge policy? If it also depends also on the merge policy which one sho

Re: Merge policy

2016-10-27 Thread Pushkar Raste
> Hi > > As you can see in the screenshot above in the oldest segments there are a > lot of deletions. In total the shard has about 26% deletions. How can I get > rid of them so the index will be smaller again? > Can this only be done with an optimize or does it also depend on the merg

Merge policy

2016-10-27 Thread Arkadi Colson
Hi As you can see in the screenshot above in the oldest segments there are a lot of deletions. In total the shard has about 26% deletions. How can I get rid of them so the index will be smaller again? Can this only be done with an optimize or does it also depend on the merge policy? If it

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-16 Thread Dmitry Kan
nd just to be really clear, you _only_ seeing more segments being > >> added, right? If you're only counting files in the index directory, it's > >> _possible_ that merging is happening, you're just seeing new files take > >> the place of old ones. > >> > >> Best, > >> Erick > >> > >> On Wed, Mar 4, 2015 at 7:12 PM, Shawn Heisey > wrote: > >>> On 3/4/2015 4:12 PM, Erick Erickson wrote: > >>>> I _think_, but don't know for sure, that the merging stuff doesn't get > >>>> triggered until you commit, it doesn't "just happen". > >>>> > >>>> Shot in the dark... > >>> > >>> I believe that new segments are created when the indexing buffer > >>> (ramBufferSizeMB) fills up, even without commits. I'm pretty sure that > >>> anytime a new segment is created, the merge policy is checked to see > >>> whether a merge is needed. > >>> > >>> Thanks, > >>> Shawn > >>> > > > > -- Dmitry Kan Luke Toolbox: http://github.com/DmitryKey/luke Blog: http://dmitrykan.blogspot.com Twitter: http://twitter.com/dmitrykan SemanticAnalyzer: www.semanticanalyzer.info

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-13 Thread Summer Shire
t merging is happening, you're just seeing new files take >> the place of old ones. >> >> Best, >> Erick >> >> On Wed, Mar 4, 2015 at 7:12 PM, Shawn Heisey wrote: >>> On 3/4/2015 4:12 PM, Erick Erickson wrote: >>>> I _think_, but don't know for sure, that the merging stuff doesn't get >>>> triggered until you commit, it doesn't "just happen". >>>> >>>> Shot in the dark... >>> >>> I believe that new segments are created when the indexing buffer >>> (ramBufferSizeMB) fills up, even without commits. I'm pretty sure that >>> anytime a new segment is created, the merge policy is checked to see >>> whether a merge is needed. >>> >>> Thanks, >>> Shawn >>> >

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-06 Thread Summer Shire
7;t get >>> triggered until you commit, it doesn't "just happen". >>> >>> Shot in the dark... >> >> I believe that new segments are created when the indexing buffer >> (ramBufferSizeMB) fills up, even without commits. I'm pretty sure that >> anytime a new segment is created, the merge policy is checked to see >> whether a merge is needed. >> >> Thanks, >> Shawn >>

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-05 Thread Erick Erickson
sure, that the merging stuff doesn't get >> triggered until you commit, it doesn't "just happen". >> >> Shot in the dark... > > I believe that new segments are created when the indexing buffer > (ramBufferSizeMB) fills up, even without commits. I&

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-04 Thread Shawn Heisey
ffer (ramBufferSizeMB) fills up, even without commits. I'm pretty sure that anytime a new segment is created, the merge policy is checked to see whether a merge is needed. Thanks, Shawn

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-04 Thread Summer Shire
actually after every commit a new segment gets created. I don't see them merging down. what all could i do to debug this better. Hasn't anyone else tried to merge their segments down to a specific range :) ? On Wed, Mar 4, 2015 at 3:12 PM, Erick Erickson wrote: > I _think_, but don't know for s

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-04 Thread Erick Erickson
I _think_, but don't know for sure, that the merging stuff doesn't get triggered until you commit, it doesn't "just happen". Shot in the dark... Erick On Wed, Mar 4, 2015 at 1:15 PM, Summer Shire wrote: > Hi All, > > I am using solr 4.7.2 is there a bug wrt merging the segments down ? > > I rec

solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-04 Thread Summer Shire
Hi All, I am using solr 4.7.2 is there a bug wrt merging the segments down ? I recently added the following to my solrConfig.xml false 100 1000 5 But I do not see any merging of the segments happening. I saw some other people have the same issue but there wasn’t much info

Re: lucene merge policy in solr

2013-03-07 Thread Erick Erickson
; In earlier lucene version it merges segements periodically > according to merge policy, when it reached merge time, indexing > request may take longer time to finish (in my test it may delay > 10-30 seconds, depending on indexed data size). > > I read solr 3.6 - 4.1 doc and w

lucene merge policy in solr

2013-03-05 Thread Zhang, Lisheng
Hi, In earlier lucene version it merges segements periodically according to merge policy, when it reached merge time, indexing request may take longer time to finish (in my test it may delay 10-30 seconds, depending on indexed data size). I read solr 3.6 - 4.1 doc and we have entries in

Re: Merge Policy Recommendation for 3.6.1

2012-09-29 Thread Sujatha Arun
r 14 segments, and possibly more. > > Assuming some things, which lead to using the 13 segment figure: > simultaneous indexing to multiple cores at once, with termvectors turned > on. With these assumptions, a 200 core Solr installation using 4 segments > might potentially have nearly

Re: Merge Policy Recommendation for 3.6.1

2012-09-28 Thread Shawn Heisey
ts might potentially have nearly 37000 files open, but is more likely to have significantly less. If you increase your merge policy segment limit, the numbers will go up from there. I have configured my Linux servers with a soft file limit of 49152 and a hard limit of 65536. My segment limit is

Merge Policy Recommendation for 3.6.1

2012-09-27 Thread Sujatha Arun
Hello, In the case where there are over 200+ cores on a single node , is it recommended to go with Tiered MP with segment size of 4 ? Our Index size vary from a few MB to 4 GB . Will there be any issue with "Too many open files " and the number of indexes with respect to MP ? At the moment we ar

Re: Merge Policy

2009-07-21 Thread Jason Rutherglen
I am referring to setting properties on the *existing* policy available in Lucene such as LogByteSizeMergePolicy.setMaxMergeMB On Tue, Jul 21, 2009 at 5:11 PM, Chris Hostetter wrote: > > : SolrIndexConfig accepts a mergePolicy class name, however how does one > : inject properties into it? > > At

Re: Merge Policy

2009-07-21 Thread Chris Hostetter
: SolrIndexConfig accepts a mergePolicy class name, however how does one : inject properties into it? At the moment you can't. If you look at the history of MergePolicy, users have never been encouraged to implement their own (the API actively discourages it, without going so far as to make

Merge Policy

2009-07-13 Thread Jason Rutherglen
SolrIndexConfig accepts a mergePolicy class name, however how does one inject properties into it?