Re: Solr 7.7 - Few Questions

2020-10-06 Thread Rahul Goswami
1. What tool they use to run Solr as a service on windows. >> Look into procrun. Afterall. Solr runs inside Jetty. So you should have a way to invoke Jetty’s Main class with required parameters and bundle that as a procrun service 2. How to set up the disaster recovery? >> You can back up your

Re: Reindexing major upgrades

2020-10-06 Thread Bram Van Dam
On 05/10/2020 16:02, Rafael Sousa wrote: > Having things reindexed from scratch is not > an option, so, is there a way of creating a 8.6.2 index from a pre-existing > 6.5 index or something like that? Sadly there is no such way. If all your fields are stored you might be able to whip up something

Re: Java GC issue investigation

2020-10-06 Thread matthew sporleder
You have a 12G heap for a 200MB index? Can you just try changing Xmx to, like, 1g ? On Tue, Oct 6, 2020 at 7:43 AM Karol Grzyb wrote: > > Hi, > > I'm involved in investigation of issue that involves huge GC overhead > that happens during performance tests on Solr Nodes. Solr version is > 6.1.

Re: Non Deterministic Results from /admin/luke

2020-10-06 Thread Andrzej Białecki
You may want to check the COLSTATUS collection command added in 8.1 (https://lucene.apache.org/solr/guide/8_6/collection-management.html#colstatus ). This reports much of the information returned by /admin/luke but

Java GC issue investigation

2020-10-06 Thread Karol Grzyb
Hi, I'm involved in investigation of issue that involves huge GC overhead that happens during performance tests on Solr Nodes. Solr version is 6.1. Last test were done on staging env, and we run into problems for <100 requests/second. The size of the index itself is ~200MB ~ 50K docs Index has

Re: Java GC issue investigation

2020-10-06 Thread Karol Grzyb
Hi Matthew, Thank you for the answer, I cannot reproduce the setup locally I'll try to convince them to reduce Xmx, I guess they will rather not agree to 1GB but something less than 12G for sure. And have some proper dev setup because for now we could only test prod or stage which are difficult

RE: Master/Slave

2020-10-06 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
> it better not ever be depreciated. it has been the most reliable mechanism > for its purpose I would like to know whether that is the consensus of Solr developers. We had been scrambling to move from Master/Slave to CDCR based on the assertion that CDCR support would last far longer than

Re: Java GC issue investigation

2020-10-06 Thread Erick Erickson
12G is not that huge, it’s surprising that you’re seeing this problem. However, there are a couple of things to look at: 1> If you’re saying that you have 16G total physical memory and are allocating 12G to Solr, that’s an anti-pattern. See:

Re: Java GC issue investigation

2020-10-06 Thread matthew sporleder
Your index is so small that it should easily get cached into OS memory as it is accessed. Having a too-big heap is a known problem situation. https://cwiki.apache.org/confluence/display/SOLR/SolrPerformanceProblems#SolrPerformanceProblems-HowmuchheapspacedoIneed? On Tue, Oct 6, 2020 at 9:44 AM

Re: Order of applying tokens/filter

2020-10-06 Thread Walter Underwood
Synonyms only need to be done once. Generally, expand synonyms at index time only. Also, consider the StandardTokeniizer. It is a bit smarter and that can be useful. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 5, 2020, at 9:08 PM,

timeAllowed default value

2020-10-06 Thread Steven White
Hi everyone, What is the default value for timeAllowed to make it behave as if it is not set? Is it "-1" or some other number? Rather than writing my code to include or not include timeAllowed in the query parameter, I rather have it be part of my query all the time and only change the value if

Re: Using streaming expressions with shards filter

2020-10-06 Thread Joel Bernstein
There is a parameter in streaming expressions for this but it is not available for use in every stream source. The search expression should honor it though. If you pass the .shard=shard1,shard2,shard3... The search stream will honor this. This work was originally done for supporting

RE: Solr 7.7 - Few Questions

2020-10-06 Thread Manisha Rahatadkar
Hi All First of all thanks to Shawn, Rahul and Charlie for taking time to reply my questions and valuable information. I was very concerned about the size of the each document and on several follow ups got more information that the documents which have 0.5GB size are mp4 documents and these

Re: Using streaming expressions with shards filter

2020-10-06 Thread Joel Bernstein
Actually it's: .shards=shard1,shard2,shard3... Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Oct 6, 2020 at 2:38 PM Joel Bernstein wrote: > > There is a parameter in streaming expressions for this but it is not > available for use in every stream source. The search expression should