Unable to set preferred leader

2020-06-23 Thread Karl Stoney
Hey, We have a SolrCloud collection with 8 replicas, and one of those replicas has the `property.preferredleader: true` set. However when we perform a `REBALANCELEADERS` we get: ``` { "responseHeader": { "status": 0, "QTime": 62268 }, "Summary": { "Failure": "Not all active

Re: collect reload causes a big latency spike

2020-03-03 Thread Karl Stoney
often? Sounds like an XY problem... Best, Erick On Tue, Mar 3, 2020, 07:38 Karl Stoney wrote: > Hi Everyone, > When we use the solr collections API to reload a collection, we get a > large latency spike in requests. I'm surprised by this because when we do > new soft commits, our w

collect reload causes a big latency spike

2020-03-03 Thread Karl Stoney
Hi Everyone, When we use the solr collections API to reload a collection, we get a large latency spike in requests. I'm surprised by this because when we do new soft commits, our warming means they're near enough undetectable. Could anyone confirm if solr collection reload not use the

Re: Async RELOADCOLLECTION never completes

2020-02-15 Thread Karl Stoney
I actually narrowed this down to changing the schema version from 1.5 to 1.6 and then doing a RELOADCOLLECTION, it consistently hangs. Several of our nodes go briefly into a recovering state too. From: Karl Stoney Sent: 13 February 2020 09:49 To: solr-user

Timeout occured while waiting response from server

2020-02-15 Thread Karl Stoney
Hi Folks, Solr 8.4.1 - we've started doing an expungeDeletes daily at 5am for various reasons but I've also started having these log messages appear in our master from some of the followers. There's no disk io/cpu contention, everything's running fast - so not sure: 1. Why we're getting

Re: Async RELOADCOLLECTION never completes

2020-02-13 Thread Karl Stoney
expired for /overseer/collection-map-failure Which I find interesting, everything (resources wise) is very healthy. On 13/02/2020, 09:34, "Karl Stoney" wrote: Hi, We’re periodically seeing an ASYNC task to RELOADCOLLECTION never complete, it’s just permanently “running”:

Async RELOADCOLLECTION never completes

2020-02-13 Thread Karl Stoney
Hi, We’re periodically seeing an ASYNC task to RELOADCOLLECTION never complete, it’s just permanently “running”: ❯ curl -s http://solr.search-solr.prod.k8.atcloud.io/solr/admin/collections\?action\=REQUESTSTATUS\\=1581585716 | jq . { "responseHeader": { "status": 0, "QTime": 2 },

Would changing the schema version from 1.5 to 1.6 require a reindex

2020-02-13 Thread Karl Stoney
Hey, I’m going to bump our schema version from 1.5 to 1.6 to get the implicit useDocValuesAsStored=true, would this require a reindex? Thanks Karl This e-mail is sent on behalf of Auto Trader Group Plc, Registered Office: 1 Tony Wilson Place, Manchester, Lancashire, M15 4FN (Registered in

Re: REINDEXCOLLECTION fatal error in DaemonStream

2020-02-12 Thread Karl Stoney
Hmm interestingly this happened when I set an `fq` (*,old_version:_version_,old_lmake:L_MAKE,old_lmodel:L_MODEL) which I pulled from our old dataimporthandler. Removing that it worked fine From: Karl Stoney Sent: 12 February 2020 17:20 To: solr-user

REINDEXCOLLECTION fatal error in DaemonStream

2020-02-12 Thread Karl Stoney
Hey folks, Trying out the REINDEXCOLLECTION but getting the following error: Anyone seen it before? 17:14:09.610 [DaemonStream-at-uk-002-88-thread-1-processing-n:solr-0.search-solr.dev.k8.atcloud.io:80_solr x:at-uk-001_shard1_replica_n1 c:at-uk-001 s:shard1 r:core_node2] ERROR

Re: Storage/Volume type for Kubernetes Solr POD?

2020-02-11 Thread Karl Stoney
. Is that correct? On Fri, Feb 7, 2020 at 5:22 AM Nicolas PARIS wrote: > hi all > > what about cephfs or lustre distrubuted filesystem for such purpose ? > > > Karl Stoney writes: > > > we personally run solr on google cloud kubernetes engine and each node > has a

Re: Storage/Volume type for Kubernetes Solr POD?

2020-02-07 Thread Karl Stoney
we personally run solr on google cloud kubernetes engine and each node has a 512Gb persistent ssd (network attached) storage which gives roughly this performance (read/write): Sustained random IOPS limit 15,360.00 15,360.00 Sustained throughput limit (MB/s) 245.76 245.76 and we get very good

Re: DataImportHandler SolrEntityProcessor configuration for local copy

2020-02-06 Thread Karl Stoney
Spoke too soon, looks like it memory leaks. After about 1.3m the old gc times went through the root and solr was almost unresponsive, had to abort. We're going to write our own implementation to copy data from one core to another that runs outside of solr. On 06/02/2020, 09:57, "Karl S

Re: DataImportHandler SolrEntityProcessor configuration for local copy

2020-02-06 Thread Karl Stoney
ader.co.uk%7C31a2300d8a0e42a9e28f08d7aadc92c7%7C926f3743f3d24b8a816818cfcbe776fe%7C0%7C0%7C637165736641024457sdata=572w%2Br7QtZ8eHORG5UVrE3yE3SZaUXsuqFpRuwE80sw%3Dreserved=0 cursorMark="true" Good luck. On Wed, Feb 5, 2020 at 10:06 PM Karl Stoney wrote: > He

DataImportHandler SolrEntityProcessor configuration for local copy

2020-02-05 Thread Karl Stoney
Hey All, I'm trying to implement a simplistic reindex strategy to copy all of the data out of one collection, into another, on a single node (no distributed queries). It's approx 4 million documents, with an index size of 26gig. Based on your experience, I'm wondering what people feel sensible

Re: Solr Cloud on Docker?

2020-02-05 Thread Karl Stoney
Nothing much to add to the below apart from we also successfully run solr on kubernetes. It took some implementation effort but we're now at a point where we can do `kubectl scale --replicas=x statefulset/solr` and increase capacity in minutes with solr's autoscaling taking care of the new

ID is a required field in SolrSchema . But not found in DataConfig

2020-02-04 Thread Karl Stoney
Hey all, I'm trying to use the DIH to copy from one collection to another, it appears to work (data gets copied) however I've noticed this in the logs: 17:39:58.167 [qtp1472216456-87] INFO org.apache.solr.handler.dataimport.config.DIHConfiguration - ID is a required field in SolrSchema . But

Re: NRT Real time Get with documentCache

2020-02-03 Thread Karl Stoney
Great stuff thank you Erick On 04/02/2020, 00:17, "Erick Erickson" wrote: The documentCache shouldn’t matter at all. RTG should return the latest doc by maintaining a pointer into the tlogs and returning that version. > On Feb 3, 2020, at 6:43 PM, Karl Stoney wrote:

NRT Real time Get with documentCache

2020-02-03 Thread Karl Stoney
Hi, Could anyone let me know if a real time get would return a cached, up to date version of a document if we enabled documentCache? Thanks Karl This e-mail is sent on behalf of Auto Trader Group Plc, Registered Office: 1 Tony Wilson Place, Manchester, Lancashire, M15 4FN (Registered in England

Connection spike when slight solr latency spike

2020-02-03 Thread Karl Stoney
Hey all, When our searcher refreshes on a soft-commit, we get a slight latency spike (p99th response times can jump up to about 200ms from 100ms), however what we see in the upstream clients using org.apache.solr.client.solrj SolrClient is a big spike in connections outbound (70-80 per client,

Re: G1GC Pauses (Young Gen)

2020-02-02 Thread Karl Stoney
? From: Karl Stoney Sent: 01 February 2020 16:13 To: solr-user@lucene.apache.org Subject: G1GC Pauses (Young Gen) Hey all, me again. I'm still investigating the pauses that I get when a soft commit happens. I'm now convinced they're coming from G1GC pauses that happen

G1GC Pauses (Young Gen)

2020-02-01 Thread Karl Stoney
Hey all, me again. I'm still investigating the pauses that I get when a soft commit happens. I'm now convinced they're coming from G1GC pauses that happen when the soft commit happens and wondering if anyone can see what's up. Caveat: I'm no JVM expert. I've uploaded a small time window to

Re: Shards.preference to current leader

2020-01-31 Thread Karl Stoney
his is one of those things I’d recommend you prove is something users actually could notice before spending any time trying to implement, it strikes me as a red herring. Best, Erick > On Jan 30, 2020, at 6:16 PM, Karl Stoney wrote: > > Hey all, > Is it pos

Shards.preference to current leader

2020-01-30 Thread Karl Stoney
Hey all, Is it possible to perform a get request which favours the current leader, therefore guaranteeing the most up to date record? I was looking at shards.preference but couldn’t see a way to prefer the leader. Get Outlook for iOS This e-mail is sent on behalf of Auto

Re: Replica type affinity

2020-01-30 Thread Karl Stoney
45528sdata=3pFUtr6o7vK0srGR60lIUc%2Fo9QSftmAcnQDkcx5z%2Bl8%3Dreserved=0 Other commands that can be useful are REPLACENODE, MOVEREPLICA. Edward On Thu, Jan 30, 2020 at 1:00 PM Karl Stoney wrote: > Hey everyone, > Does anyone know of a way to have solr replicas assigned to specific node

Re: Performance Issue since Solr 7.7 with wt=javabin

2020-01-30 Thread Karl Stoney
kef> From: Karl Stoney Sent: Thursday, January 30, 2020 3:56:31 PM To: solr-user@lucene.apache.org Subject: Re: Performance Issue since Solr 7.7 with wt=javabin I don’t have confidence in my ability to do that, I was hoping someone could help out as moving to 8.4 is to

Replica type affinity

2020-01-30 Thread Karl Stoney
Hey everyone, Does anyone know of a way to have solr replicas assigned to specific nodes by some sort of identifying value (in solrcloud). In summary I’m trying to have some Read only replicas only every be assigned to nodes named “solr-ephemeral-x” and my nrt and masters assigned to

Re: Performance Issue since Solr 7.7 with wt=javabin

2020-01-30 Thread Karl Stoney
ary 30, 2020 2:23:40 PM To: solr-user Subject: Re: Performance Issue since Solr 7.7 with wt=javabin No further releases are planned for 7.x, so your best bet is to patch branch_7_7 yourself and build a custom Solr version. Jan > 29. jan. 2020 kl. 20:54 skrev Karl Stoney > : > > Co

Re: Solr Searcher 100% Latency Spike

2020-01-30 Thread Karl Stoney
d of spike worth? You certainly see it with monitoring tools, but do users notice at all? If not, I wouldn’t spend all that much effort pursuing it… Best, Erick > On Jan 29, 2020, at 4:48 PM, Karl Stoney wrote: > > So interestingly tweaking my filter cache i've got the

Re: Solr Searcher 100% Latency Spike

2020-01-29 Thread Karl Stoney
) > On Jan 29, 2020, at 1:21 PM, Karl Stoney > wrote: > > Out of curiosity, could you define "fast"? > I'm wondering what sort of figures people target their searcher warm time at > > From: Walter Underwood > Sent: 29 January 2020 21:13

Re: Solr Searcher 100% Latency Spike

2020-01-29 Thread Karl Stoney
d7a5002b8e%7C926f3743f3d24b8a816818cfcbe776fe%7C0%7C0%7C637159292473223261sdata=ZCCITDfh2TlR4KKwLzZ%2BVQL1b6%2F3OXewXFS1T3nhlVo%3Dreserved=0 (my blog) > On Jan 29, 2020, at 1:01 PM, Shawn Heisey wrote: > > On 1/29/2020 12:44 PM, Karl Stoney wrote: >> Looking for a bit of support here. When we soft

Re: Solr Searcher 100% Latency Spike

2020-01-29 Thread Karl Stoney
above 90, which should reduce some memory pressure at least. Thanks Karl From: Shawn Heisey Sent: 29 January 2020 21:01 To: solr-user@lucene.apache.org Subject: Re: Solr Searcher 100% Latency Spike On 1/29/2020 12:44 PM, Karl Stoney wrote: > Looking for a

Re: Performance Issue since Solr 7.7 with wt=javabin

2020-01-29 Thread Karl Stoney
Could anyone produce a patch for 7.7 please? From: Florent Sithi Sent: 29 January 2020 14:34 To: solr-user@lucene.apache.org Subject: Re: Performance Issue since Solr 7.7 with wt=javabin yes thanks so much, fixed in 8.4.0 -- Sent from:

Solr Searcher 100% Latency Spike

2020-01-29 Thread Karl Stoney
Hi All, Looking for a bit of support here. When we soft commit (every 10 minutes), we get a latency spike that means response times for solr are loosely double, as you can see in this screenshot: [cid:ed9fa791-0776-43fc-8f22-d8a568f5c084] These do correlate to GC spikes (albeit not

Unable to start sole-exporter on branch_7x

2019-04-10 Thread Karl Stoney
Hi, I’m getting the following error when trying to start `solr-exporter` on branch `7_x`. INFO - 2019-04-10 23:36:10.872; org.apache.solr.core.SolrResourceLoader; solr home defaulted to 'solr/' (could not find system property or JNDI) Exception in thread "main" java.lang.NoClassDefFoundError:

Errors during solrcloud replication (7.7.x)

2019-03-01 Thread Karl Stoney
Hey all, I’m looking for some support with replication errors we’re seeing in SolrCloud 7.7.x (tried both .0 and .1). I’ve created a StackOverflow issue: We have errors in SolrCloud (7.7.1) during replication, which we can't understand. We thought it may be