Re: Question about "No registered leader" error

2019-09-19 Thread Hongxu Ma
@Shawn @Erick Thanks for your kindle help! No OOM log and I confirm there was no OOM happened. My ZK ticktime is set to 5000, so 5000*20 = 100s > 60s, and I checked solr code: the leader waiting time: 4000ms is a const variable, is not configurable. (why it isn't a configurable param?) My

Re: Unsubscribe please

2019-09-19 Thread Gora Mohanty
Hi, Please see https://lucene.apache.org/solr/community.html#mailing-lists-irc . In order to unsubscribe, please send mail to solr-user-unsubscribe Regards, Gora >

fq * vs [* TO *]

2019-09-19 Thread Vincenzo D'Amore
Hi all, talking about how to write solr queries I was investigating if there is a difference of performance in these two filter queries: field:[* TO *] or field:* In other words: q=*:*=field:[* TO *]=0 q=*:*=field:*=0 Could someone enlighten me? -- Vincenzo D'Amore

Re: DIH: Create Child Documents in ScriptTransformer

2019-09-19 Thread Mikhail Khludnev
Hello, Jörn. Have you tried to find a parent doc in the context which is passed as a second argument into ScriptTransformer? On Wed, Sep 18, 2019 at 9:56 PM Jörn Franke wrote: > > Hi, > > I load a set of documents. Based on these documents some logic needs to be > applied to split them into

Unsubscribe please

2019-09-19 Thread Smida Mahdi
Unsubscribe please : m.sm...@brgm.fr [Image result for logo brgm] Mahdi SMIDA DISN / ISE Tél : 02.38.64.35.38 SUIVEZ LE BRGM : [Image result for twitter logo][Image result for linkedin

Re: fq * vs [* TO *]

2019-09-19 Thread Mikhail Khludnev
Hello, Vincenzo. I traced both code pathes, they are different. It's hard to predict the difference between them. Probably some thorough microbenchmark can show two times fold or so, but don't think it's significant for practical usage. On Thu, Sep 19, 2019 at 10:23 AM Vincenzo D'Amore wrote: >

Re: DIH: Create Child Documents in ScriptTransformer

2019-09-19 Thread Jörn Franke
I fully agree. However, I am just curious to see the limits. > Am 18.09.2019 um 23:33 schrieb Erick Erickson : > > When it starts getting complex, I usually move to SolrJ. You say > you're loading documents, so I assume Tika is in the mix too. > > Here's a blog on the topic so you an see how to

Re: Solr query fetching different results

2019-09-19 Thread Erick Erickson
Multiple replicas of the same shard will execute their autocommits at different wall clock times. Thus there may be a _temporary_ time when newly-indexed document is found by a query that happens to get served by replica1 but not by replica2. If you have a timestamp in the doc, and a soft commit

Re: Custom update processor not kicking in

2019-09-19 Thread Erick Erickson
_Why_ is reindexing not an option? 200M doc isn't that many. Since you have Atomic updates working, you could easily write a little program that pulled the docs from you existing collection and pushed them to a new one with the new schema. Do use CursorMark if you try that You have to be

Re: fq * vs [* TO *]

2019-09-19 Thread Vincenzo D'Amore
Thanks Mikhail, could you please share the code paths you found? On Thu, Sep 19, 2019 at 10:57 AM Mikhail Khludnev wrote: > Hello, Vincenzo. > I traced both code pathes, they are different. It's hard to predict the > difference between them. Probably some thorough microbenchmark can show two >

Re:Solr query fetching different results

2019-09-19 Thread Ramsey Haddad (BLOOMBERG/ LONDON)
Your query seems simple enough that this may not be your issue, but just mentioning it: Your collection has 1 shard. Depending on how the query is sent, queries to 1 shard collections can sometimes get interpreted as a "distributed query" and sometimes as a "non-distributed query". These have

Custom auth plugin for SolrCloud

2019-09-19 Thread Zubovich Yauheni
Hi, I have very very simple task - we need to protect access to Solr if request doesn't have specific header. Solr 7.3 is running in cloud mode. Was implemented custom auth plugin: package com.custom.solr.core; import org.apache.solr.security.AuthenticationPlugin; public class

Re: fq * vs [* TO *]

2019-09-19 Thread Mikhail Khludnev
https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/parser/SolrQueryParserBase.java#L1234 https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/parser/SolrQueryParserBase.java#L1184 On Thu, Sep 19, 2019 at 1:10 PM Vincenzo D'Amore

Re: Custom auth plugin for SolrCloud

2019-09-19 Thread Shawn Heisey
On 9/19/2019 6:18 AM, Zubovich Yauheni wrote: This class is wrapped into jar. Jar added to server lib directory and defined at solrconfig.xml: Where exactly is this "server lib" directory that you describe? If it's one of the locations that gets loaded automatically, you should NOT be

Reloading after creating a collection

2019-09-19 Thread Arnold Bronley
Hi, Why is it that I need to reload collection after I created it? CDCR runs into issues if I do not do this.

Re: Solr behaves wonky when zookeeper quorom is messed up.

2019-09-19 Thread Shawn Heisey
On 9/19/2019 9:22 AM, harjagsbby wrote: In our PROD SOLR cluster(7.6 and ZK:3.4.9) when Zookeeper leader fails Zookeeper enter an infinite leader election loop which makes SOLR instable. Solr Fails to index(as Expected with error "Remote error message: Cannot talk to ZooKeeper - Updates are

Re: Reloading after creating a collection

2019-09-19 Thread Shawn Heisey
On 9/19/2019 9:36 AM, Arnold Bronley wrote: Why is it that I need to reload collection after I created it? CDCR runs into issues if I do not do this. If the config doesn't change after creation, I would not expect that to be required. If you do change the config to enable CDCR after the

Solr behaves wonky when zookeeper quorom is messed up.

2019-09-19 Thread harjagsbby
In our PROD SOLR cluster(7.6 and ZK:3.4.9) when Zookeeper leader fails Zookeeper enter an infinite leader election loop which makes SOLR instable. Solr Fails to index(as Expected with error "Remote error message: Cannot talk to ZooKeeper - Updates are disabled") and CPU spikes up. This is a know

Re: Solr behaves wonky when zookeeper quorom is messed up.

2019-09-19 Thread harjagsbby
If ZK loses quorum, Solr goes read-only. That's how it's designed to work. I don't know of any workaround for that. That makes sense. CPU spiking in solr is because solr's index calls are holding threads because zookeeper is down as per solr? -- Sent from:

Re: Custom update processor not kicking in

2019-09-19 Thread Rahul Goswami
Eric, The 200 million docs are all large as they are content indexed. Also it would be hard to convince the customer to rebuild their index. But more than that, I also want to clear my understanding on this topic and know if it’s an expected behaviour for a distributed update processor to not call

Re: Reloading after creating a collection

2019-09-19 Thread Arnold Bronley
Hi, I am not changing the the config to enable CDCR. I am just using the CDCR API to start it. Does that count as changing configuration? On Thu, Sep 19, 2019 at 12:20 PM Shawn Heisey wrote: > On 9/19/2019 9:36 AM, Arnold Bronley wrote: > > Why is it that I need to reload collection after I

Re: Reloading after creating a collection

2019-09-19 Thread Shawn Heisey
On 9/19/2019 12:09 PM, Arnold Bronley wrote: I am not changing the the config to enable CDCR. I am just using the CDCR API to start it. Does that count as changing configuration? I would guess that there are no changes to the config from using the API, but I also admit that I have never tried

Re: DIH: Create Child Documents in ScriptTransformer

2019-09-19 Thread Jörn Franke
Hi, thanks for all the feedback. The context parameter in the ScriptTransformer is new to me - thanks for this insight. I could not find it in any docs. So just for people that also did not know it: you can have the ScriptTransformer with 2 parameters, e.g. function mytransformer(row,context){

Re: Sample JWT Solr configuration

2019-09-19 Thread Tyrone Tse
I finally got JWT Authentication working on Solr 8.1.1. This is my security.json file contents { "authentication":{ "class":"solr.JWTAuthPlugin", "jwk":{ "kty":"oct", "use":"sig", "kid":"k1",

Re: Sample JWT Solr configuration

2019-09-19 Thread Dave
I know this has nothing to do with the issue at hand but if you have a public facing solr instance you have much bigger issues. > On Sep 19, 2019, at 10:16 PM, Tyrone Tse wrote: > > I finally got JWT Authentication working on Solr 8.1.1. > This is my security.json file contents > { >

How do I index PDF and Word Doc

2019-09-19 Thread PasLe Choix
I am on Solr 7.7, according to the official document: https://lucene.apache.org/solr/guide/7_7/solr-tutorial.html Although it is mentioned Post Tool can index a directory of files, and can handle HTML, PDF, Office formats like Word, however no example working command is given. ./bin/post -c

Re: fq * vs [* TO *]

2019-09-19 Thread Shawn Heisey
On 9/19/2019 1:23 AM, Vincenzo D'Amore wrote: talking about how to write solr queries I was investigating if there is a difference of performance in these two filter queries: field:[* TO *] or field:* In other words: q=*:*=field:[* TO *]=0 q=*:*=field:*=0 The first one is a range query,

Re: Sample JWT Solr configuration

2019-09-19 Thread Tyrone
Of course the secret key is just for my local development solr instance  > On Sep 19, 2019, at 10:35 PM, Dave wrote: > > I know this has nothing to do with the issue at hand but if you have a public > facing solr instance you have much bigger issues. > >> On Sep 19, 2019, at 10:16 PM,

Re: fq * vs [* TO *]

2019-09-19 Thread Vincenzo D'Amore
Hi Shawn, Mikhail, thanks for the feedback. Really appreciate it. Best regards, Vincenzo On Thu, Sep 19, 2019 at 3:55 PM Shawn Heisey wrote: > On 9/19/2019 1:23 AM, Vincenzo D'Amore wrote: > > talking about how to write solr queries I was investigating if there is a > > difference of