Re: Question on metric values

2020-10-26 Thread Andrzej Białecki
” you mean rate of requests over a time period then the 1-, 5- and 15-min rates are available from “QUERY./select.requestTimes” — Andrzej Białecki > On 26 Oct 2020, at 17:25, yaswanth kumar wrote: > > I am new to metrics api in solr , when I try to do > solr/admin/metrics?

Re: Is metrics api enabled by default in solr 8.2

2020-10-14 Thread Andrzej Białecki
SOLR-14914 (scheduled for 8.7) adds a boolean property (either by modifying solr.xml:/metrics element or via “metricsEnabled” system property) to almost completely turn off the metrics collection and processing. The “almost” part means that the instrumentation still remains in place, but the

Re: Non Deterministic Results from /admin/luke

2020-10-06 Thread Andrzej Białecki
You may want to check the COLSTATUS collection command added in 8.1 (https://lucene.apache.org/solr/guide/8_6/collection-management.html#colstatus ). This reports much of the information returned by /admin/luke but

Re: Solr waitForMerges() causing leaderless shard during shutdown

2020-09-28 Thread Andrzej Białecki
Hi Ramsey, This is an interesting scenario, I vaguely remember someone (Cao Manh Dat?) on a similar issue - I’m not sure if newer versions of Solr already fixed that but it would be helpful to create a Jira issue to investigate it and verify that it’s indeed fixed in a more recent Solr

Re: SegmentsInfoRequestHandler does not release IndexWriter

2020-04-23 Thread Andrzej Białecki
Hi Tiziano, Indeed, this looks like a bug - good catch! Please file a Jira issue, I’ll get to it soon. > On 23 Apr 2020, at 00:19, Tiziano Degaetano > wrote: > > Hello, > > I’m digging in an issue getting timeouts doing a managed schema change using > the schema api. > The call hangs

Re: How to compute index size

2020-02-04 Thread Andrzej Białecki
If you’re using Solr 8.2 or newer there’s a built-in index analysis tool that gives you a better understanding of what kind of data in your index occupies the most disk space, so that you can tweak your schema accordingly:

Re: [EXTERNAL] Autoscaling simulation error

2019-12-19 Thread Andrzej Białecki
Hi, Thanks for the data. I see the problem now - it’s a bug in the simulator. I filed a Jira issue to track and fix it: SOLR-14122. > On 16 Dec 2019, at 19:13, Cao, Li wrote: > >> I am using solr 8.3.0 in cloud mode. I have collection level autoscaling >> policy and the collection name is

Re: "No value present" when set cluster policy for autoscaling in solr cloud mode

2019-12-19 Thread Andrzej Białecki
Hi, For some strange reason global tags (such as “cores”) don’t support the “nodeset” syntax. For “cores” the only supported attribute is “node”, and then you’re only allowed to use #ANY or a single specific node name (with optional “!" NOT operand), or a JSON array containing node names to

Re: Autoscaling simulation error

2019-12-15 Thread Andrzej Białecki
Could you please provide the exact command-line? It would also help if you could provide an autoscaling snapshot of the cluster (bin/solr autoscaling -save ) or at least the autoscaling diagnostic info. (Please note that the mailing list removes all attachments, so just provide a link to the

Re: Icelandic support in Solr

2019-11-27 Thread Andrzej Białecki
If I’m not mistaken Hunspell supports Icelandic (see here: https://cgit.freedesktop.org/libreoffice/dictionaries/tree/is ) and Lucene HunspellStemFilter should be able to use these dictionaries. > On 27 Nov 2019, at 10:10, Charlie

Re: Possible bug in cluster status - > solr 8.3

2019-11-21 Thread Andrzej Białecki
AFAIK these collection properties are not tracked that faithfully and can get out of sync, mostly because they are used only during collection CREATE and BACKUP / RESTORE and not during other collection operations or during searching / indexing. SPLITSHARD doesn’t trust them, instead it checks

Re: Metrics avgRequestsPerSecond and avgRequestsPerSecond from documentation gone?

2019-11-20 Thread Andrzej Białecki
Hi, Yes, the documentation needs to be fixed, these attributes have been removed or replaced. * avgRequestsPerSecond -> requestTimes:meanRate. Please note that this is a non-decaying simple average based on the total wall clock time elapsed since the handler was started until NOW, and the

Re: daily SolrCloud collection wipes

2019-11-18 Thread Andrzej Białecki
This default autoscaling config helps to keep some aspects of SolrCloud clean - specifically: * Inactive shard plan: it periodically checks whether there are old shards in INACTIVE state that can be removed. Shards in this state are left-over parent shards remaining after a *successful*

Re: Metrics API - Documentation

2019-10-15 Thread Andrzej Białecki
n the apache-lucene repository? > > Thanks, > > > On Tue, 8 Oct 2019 at 11:04, Andrzej Białecki wrote: > >> Hi, >> >> Starting with Solr 7.0 all JMX metrics are actually internally driven by >> the metrics API - JMX (or Prometheus) is just a way of exp

Re: Metrics API - Documentation

2019-10-08 Thread Andrzej Białecki
/management/GarbageCollectorMXBean.html?is-external=true <https://docs.oracle.com/javase/7/docs/api/java/lang/management/GarbageCollectorMXBean.html?is-external=true> * "A latency of 1mil” - no idea what that is, I don’t think Solr API uses this abbreviation anywhere. Hope this helps.

Re: HDFS Shard Split

2019-09-17 Thread Andrzej Białecki
SplitShardCmd assumes that its main phase (when the Lucene index is being split) always executes on the local file system of the shard leader, and indeed the ShardSplitCmd.checkDiskSpace() checks the local file system’s free disk space - even though in reality in your case the actual data is

Re: Is shard split operation multithreaded?

2019-09-17 Thread Andrzej Białecki
If I understand your question correctly .. it’s single-threaded with regard to a specific shard - but you can run multiple shard splitting operations in parallel IFF they affect different shards (or different collections). See SplitShardCmd for the details of locking and how the new sub-shards

Re: Solr 7.7.2 - Autoscaling in new cluster ignoring sysprop rules, possibly all rules

2019-06-28 Thread Andrzej Białecki
Andrew, please create a JIRA issue - in my opinion this is a bug not a feature, or at least something that needs clarification. > On 27 Jun 2019, at 23:56, Andrew Kettmann > wrote: > > I found the issue. Autoscaling seems to silently ignore rules (at least > sysprop rules). Example rule: >

Re: Solr 7.7.2 - SolrCloud - SPLITSHARD - Using LINK method fails on disk usage checks

2019-06-19 Thread Andrzej Białecki
Hi Andrew, Please create a JIRA issue and attach this patch, I’ll look into fixing this. Thanks! > On 18 Jun 2019, at 23:19, Andrew Kettmann > wrote: > > Attached the patch, but that isn't sent out on the mailing list, my mistake. > Patch below: > > > > ### START > > diff --git >

Re: Solr 8.1.1, JMX and VisualVM

2019-05-30 Thread Andrzej Białecki
Hi, This has to do with the new JVM flags that optimise performance, they were added roughly at the same time when Solr switched to G1GC. In ‘bin/solr’ please comment out this flag: '-XX:+PerfDisableSharedMem'. > On 30 May 2019, at 14:59, Markus Jelsma wrote: > > Hello, > > Slight

[ANNOUNCE] Apache Solr 8.1.1 released

2019-05-28 Thread Andrzej Białecki
## 28 May 2019, Apache Solr™ 8.1.1 available The Lucene PMC is pleased to announce the release of Apache Solr 8.1.1 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted

Re: Distributed IDF in Alias

2019-05-18 Thread Andrzej Białecki
Yes, the IDFs will be different. You could probably implement a custom component that would take term statistics from the previous collections to pre-populate the stats of the current collection, but this is an uncharted area, there’s a lot that could go wrong. Eg. if there’s a genuine shift in

Re: Distributed IDF in Alias

2019-05-17 Thread Andrzej Białecki
Both descriptions are correct, but in their context. The description in the Ref Guide in the section about ExactStatsCache is correct in the sense that it uses collection-wide IDF values for terms when calculating scores for different SHARDS (and merging partial per-shard lists). This means

Re: Solr 8.1 issue with collection aliases

2019-05-16 Thread Andrzej Białecki
Yes, I can work on 8.1.1 release - I’ll announce this shortly. > On 16 May 2019, at 13:51, Ishan Chattopadhyaya > wrote: > > Absolutely. This is a critical feature. > Andrzej, would you have time to do a 8.1.1 release? We also need to > coordinate with Jan, since he's doing a 7.7.2 release

Re: The parent shard will never be delete/clean?

2019-01-23 Thread Andrzej Białecki
Solr 7.4.0 added a periodic maintenance task that cleans up old inactive parent shards left after the split. “Old” means 2 days by default. > On 22 Jan 2019, at 15:31, Jason Gerlowski wrote: > > Hi, > > You might want to check out the documentation, which goes over > split-shard in a bit more

Re: SPLITSHARD throwing OutOfMemory Error

2018-10-04 Thread Andrzej Białecki
gt; >>> This threw java.lang.OutOfMemoryError: Java heap space >>> >>> 2. >>> >>> >> http://localhost:8983/solr/admin/collections?collection=testcollection=shard1=SPLITSHARD=1000 >>> >>> Then I ran with async=1000 and checked the status. Every time It's >> creating >>> the sub shards, but not splitting the index. >>> >>> Is there something that I am not doing correctly? >>> >>> Please guide. >>> >>> Thanks, >>> Atita >>> >> — Andrzej Białecki

Re: Solr 7.4.0 - bug in JMX cache stats?

2018-09-18 Thread Andrzej Białecki
size=0, hitratio=0.0, cumulative_lookups=0, cumulative_hitratio=0.0, > warmupTime=0, inserts=0} > > > So the question is - was this intentional change or a bug? > > Thanks, > >Bojan — Andrzej Białecki

Re: Autoscaling and inactive shards

2018-06-18 Thread Andrzej Białecki
>> 18. jun. 2018 kl. 12:12 skrev Andrzej Białecki >> : >> >> If I’m not mistaken the weird accounting of “inactive” shard cores is caused >> also by the fact that individual cores that constitute replicas in the >> inactive shard are still loaded, so they st

Re: Autoscaling and inactive shards

2018-06-18 Thread Andrzej Białecki
If I’m not mistaken the weird accounting of “inactive” shard cores is caused also by the fact that individual cores that constitute replicas in the inactive shard are still loaded, so they still affect the number of active cores. If that’s the case then we should probably fix this to prevent

Re: Expose a metric for percentage-recovered during full recoveries

2018-03-15 Thread Andrzej Białecki
Hi S G, This looks useful, and it should be easy to add to the existing metrics in ReplicationHandler, probably somewhere around ReplicationHandler:856 . > On 14 Mar 2018, at 20:16, S G wrote: > > Hi, > > Solr does full recoveries very frequently - sometimes even

Re: Heads up: SOLR-10130, Performance issue in Solr 6.4.1

2017-02-13 Thread Andrzej Białecki
> On 13 Feb 2017, at 13:46, Ere Maijala wrote: > > Hi all, > > this is just a quick heads-up that we've stumbled on serious performance > issues after upgrading to Solr 6.4.1 apparently due to the new metrics > collection causing a major slowdown. I've filed an issue