Re: Solr Cloud existing shards down after enabling SSL

2019-02-11 Thread Zheng Lin Edwin Yeo
When you generate the keystore, did you include the IP address of both servers? Regards, Edwin On Mon, 11 Feb 2019 at 21:06, Rakesh Enjala wrote: > Please help > *Regards,* > *Rakesh Enjala* > > > On Wed, Feb 6, 2019 at 2:59 PM Rakesh Enjala > wrote: > > > Hi, > > > > We have a solr cloud

[search > edismax] compound words different result issue

2019-02-11 Thread 유정인
Hi I use 'edismax'. Our main language uses compound words. There is an issue here. For example, assume that 'ab' => 'a' and 'b' are analyzed. The results are different when searching with 'ab' and 'a b'. I want to get the same result as searching 'a b' when searching 'ab'. Is there a

Re: Document Score seen in debug section and in main results section dont match

2019-02-11 Thread Baloo
Thanks Erick to answer your question "What is "Y"?" Score that we see in debug section actually looks correct and if we order documents by that score we can get similar ranking of results that we were getting for solr 6.4.2. But With the score field that we get with each record it looks like

Re: Relevancy Score Calculation

2019-02-11 Thread Ashish Bisht
Thanks.I Agree. Regards Ashish -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Document Score seen in debug section and in main results section dont match

2019-02-11 Thread Erick Erickson
No workarounds that I know of, but I have to ask: "Why do you care?". This feels like an XY problem. You are saying that "X" doesn't work, in this case the scores are different in the debug section. But this implies that there is a problem "Y" that you're having. What is "Y"? What is the problem

Re: Full index replication upon service restart

2019-02-11 Thread Erick Erickson
bq. To answer your question about index size on disk, it is 3 TB on every node. As mentioned it's a 32 GB machine and I allocated 24GB to Java heap. This is massively undersized in terms of RAM in my experience. You're trying to cram 3TB of index into 32GB of memory. Frankly, I don't think

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
OK, thank you guys :) Regards, Boban

Re: Full index replication upon service restart

2019-02-11 Thread Rahul Goswami
Thanks for the response Eric. To answer your question about index size on disk, it is 3 TB on every node. As mentioned it's a 32 GB machine and I allocated 24GB to Java heap. Further monitoring the recovery, I see that when the follower node is recovering, the leader node (which is NOT

Delete by id

2019-02-11 Thread Dwane Hall
Hey Solr community, I’m having an issue deleting documents from my Solr index and am seeking some community advice when somebody gets a spare minute. It seems really like a really simple problem …a requirement to delete a document by its id. Here’s how my documents are mapped in solr DOC_ID

Re: Load balance writes

2019-02-11 Thread Jason Gerlowski
> On the other hand, the CloudSolrClient ignores errors from Solr, which makes > it unacceptable for production use. Did you mean "ConcurrentUpdateSolrClient"? I don't think CloudSolrClient does this, though I've been surprised before and possible I just missed something. Just wondering.

Re: Load balance writes

2019-02-11 Thread Walter Underwood
The update router would also need to look for failures indexing at each leader, then re-read the cluster state to see if the leader had changed. Also re-send any failed updates, and so on. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 11,

Re: Load balance writes

2019-02-11 Thread lstusr 5u93n4
Hi Boban, First of all: I agree with Walter here. Because the bottleneck is during indexing on the leader, a basic round robin load balancer will perform just as well as a custom solution. With far less headache. A custom solution will be far more work than it's worth. But, should you really

Re: Load balance writes

2019-02-11 Thread Walter Underwood
For the fourth time, ignore the shard leaders until you have measurements that prove the complexity is worth it. We can index a million documents per minute by sending batched updates to a dumb load balancer. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
Like I said before, nginx is not a load balancer or at least not a clever load balancer. It does not talk to ZK. Please give me advanced solutions. > On 11. Feb 2019, at 18:32, Walter Underwood wrote: > > I haven’t used Kubernetes, but a web search for “helm nginx” seems to give > some

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
But like I said in the previous message, nginx is not aware of the status of Solr nodes. I can easily write Go load balancer but not considering the shards. The only problem I have here is how to figure out which shard master is responsible of a document I want to insert to the index. How does

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
This is naive load balancing because it is not aware of ZK. > On 11. Feb 2019, at 18:05, Walter Underwood wrote: > > nginx > > http://nginx.org/en/docs/http/load_balancing.html > https://hub.docker.com/_/nginx > > We run in Amazon AWS, so we use their Application Load Balaner (ALB). We do

Re: Load balance writes

2019-02-11 Thread Walter Underwood
I haven’t used Kubernetes, but a web search for “helm nginx” seems to give some useful pages. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 11, 2019, at 9:13 AM, Davis, Daniel (NIH/NLM) [C] > wrote: > > I think that the container

RE: Load balance writes

2019-02-11 Thread Davis, Daniel (NIH/NLM) [C]
I think that the container orchestration framework takes care of that for you, but I am not an expert. In Kubernetes, NGINX is often the Ingress controller, and as long as the services are running within the Kubernetes cluster, it can also serve as a load balancer, AFAICT. In Kubernetes, a

Re: Load balance writes

2019-02-11 Thread Walter Underwood
nginx http://nginx.org/en/docs/http/load_balancing.html https://hub.docker.com/_/nginx We run in Amazon AWS, so we use their Application Load Balaner (ALB). We do use nginx for other things. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 11,

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
Can you mention one dockerized load balancer? Or even better one with Helm chart? Like I said, I send all updates at the moment just to one out of 12 nodes. > On 11. Feb 2019, at 17:52, Walter Underwood wrote: > > Why would you want to write a load balancer when there are so many that are

Re: Load balance writes

2019-02-11 Thread Walter Underwood
Why would you want to write a load balancer when there are so many that are free and very fast? For update traffic, there is very little benefit in sending updates directly to the shard leader. Forwarding an update to the leader is fast. Indexing is slow. So the bottleneck is always at the

Createsnapshot null pointer exception

2019-02-11 Thread SOLR4189
Hi all, I use SOLR-6.5.1. When I run this command: *http://my_server_name:8983/solr/admin/collections?action=CREATESNAPSHOT=collection_name=MYCommit* I got this exception: Collection: collection_name operation: createsnapshot failed: java.lang.NullPointerException at

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
I would actually like to write a load balancer itself, but I want it to be able to send the data as efficiently as possible. I know how to read ZK data, but I don’t know how can I figure out which shard is responsible upon data that I have in a document that I want to index. > On 11. Feb

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
Thank you again Emir. I can make my code ZK aware, that is no problem, but I can’t make it shard leader aware. Can you point me to a document how are Solr shards created? I already use ZK to get stuff, but I don’ t understand how to distinguish between shards from information I can get from a

Re: Load balance writes

2019-02-11 Thread Walter Underwood
We send all updates to the load balancer, so they’ll end up on the wrong shard, not on the leader, etc. Indexing speed is still limited by the CPU available on each leader. I don’t think that sending the update to the right leader makes any improvement in throughput. On the other hand, the

Re: Load balance writes

2019-02-11 Thread Emir Arnautović
Hi Boban, Not sure if there is Solrj port to Go, but you can take that as model to build your ZK aware client that groups and sends updates to shard leaders. I see that there are couple of Solr Go clients, so you might first check if some already supports it or if it makes sense that you

Re: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-11 Thread Zheng Lin Edwin Yeo
Hi, Should we report this as a bug in Solr? Regards, Edwin On Fri, 8 Feb 2019 at 22:18, Zheng Lin Edwin Yeo wrote: > Hi Paul, > > Regarding the regex (\n\s*){2,} that we are using, when we try in on > https://regex101.com/, it is able to give us the correct result for all > the examples (ie:

Re: Load balance writes

2019-02-11 Thread Boban Acimovic
Thank you Emir for quick reply. I use home brewed Go client and write just to one of 12 available nodes. I believe I should find out this smart way to handle this :) > On 11. Feb 2019, at 15:21, Emir Arnautović > wrote: > > Hi Boban, > If you use SolrCloud Solrj client and initialise it

[ANNOUNCE] Luke 7.7.0 released

2019-02-11 Thread Tomoko Uchida
Hi, Luke 7.7.0 is out. Zip archive can be downloaded at here: https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.7.0 In this release, - Lucene version was upgraded to 7.7.0. - Some trivial UI bugs were fixed. Regards, Tomoko

RE: Re: Enable SSL for the existing SOLR Cloud Cluster

2019-02-11 Thread rakesh.enjala
Hi, I am facing the same issue. If u get any solution please post -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Load balance writes

2019-02-11 Thread Emir Arnautović
Hi Boban, If you use SolrCloud Solrj client and initialise it with ZK, it should be aware of masters and send documents in a smart way. HTH, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 11 Feb

Document Score seen in debug section and in main results section dont match

2019-02-11 Thread Baloo
HI All, Currently I am migrating Solr 6.4.2 to Solr 7.4. We pass multiple boost queries (multiplicative boost queries, Solr's 'boost' parameter) to Solr with each query. We have migrated all our custom components and solr configurations to Solr 7.4.2 but during verification we have seen

Re: Solr Cloud existing shards down after enabling SSL

2019-02-11 Thread Rakesh Enjala
Please help *Regards,* *Rakesh Enjala* On Wed, Feb 6, 2019 at 2:59 PM Rakesh Enjala wrote: > Hi, > > We have a solr cloud with 4 nodes installed in two different servers( 1 > on on server and 3 on other server)and a collection with data in 4 shards. > We have enabled SSL for solrcloud by

Load balance writes

2019-02-11 Thread Boban Acimovic
I am wondering would I get performance benefits if I distribute writes to Solr nodes by sending documents exactly to the master of collection where the document belongs? My idea is that this would save some load between the cluster nodes and improve performances. How to do writes in the best

Re: Ignore accent in a request

2019-02-11 Thread elisabeth benoit
Thanks for the hint. We've been using the char filter for full unidecode normalization. Is the ICUFoldingFilter supposed to be faster? Or just simpler to use? Le lun. 11 févr. 2019 à 09:58, Ere Maijala a écrit : > Please note that mapping characters works well for a small set of > characters,

[ANNOUNCE] Apache Solr 7.7.0 released

2019-02-11 Thread jim ferenczi
11 February 2019, Apache Solr™ 7.7.0 available The Lucene PMC is pleased to announce the release of Apache Solr 7.7.0 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting,

Re: Ignore accent in a request

2019-02-11 Thread Ere Maijala
Please note that mapping characters works well for a small set of characters, but if you want full UNICODE normalization, take a look at the ICUFoldingFilter: https://lucene.apache.org/solr/guide/6_6/filter-descriptions.html#FilterDescriptions-ICUFoldingFilter --Ere elisabeth benoit kirjoitti

Re: Unable to create collection with custom queryParser Plugin

2019-02-11 Thread Aroop Ganguly
Thanks Erick, Jörn danke sehr. tldr; gradle trickery and thriftiness helped here. detail: To make things easier, for our deployment systems, I created a plugin gradle task which is economical and yet brings in the right number of jars. in my case scala-lang jars were required, everything else