Re: SolrCloud Replication Failure

2018-10-31 Thread Kevin Risden
I haven't dug into why this is happening but it definitely reproduces. I removed the local requirements (port mapping and such) from the gist you posted (very helpful). I confirmed this fails locally and on Travis CI. https://github.com/risdenk/test-solr-start-stop-replica-consistency I don't

Re: SolrCloud Replication Failure

2018-10-31 Thread Jeremy Smith
Thanks Erick, this is 7.5.0. From: Erick Erickson Sent: Wednesday, October 31, 2018 8:20:18 PM To: solr-user Subject: Re: SolrCloud Replication Failure What version of solr? This code was pretty much rewriten in 7.3 IIRC On Wed, Oct 31, 2018, 10:47 Jeremy Smith

Re: hdfs - documents missing after hard poweroff

2018-10-31 Thread Kevin Risden
Also do you have auto add replicas turned on for these collections over HDFS? Kevin Risden On Wed, Oct 31, 2018 at 8:20 PM Kevin Risden wrote: > So I'm definitely curious what is going on here. > > Are you still able to reproduce this? Can you check if files have been > modified on HDFS? I'd

Re: hdfs - documents missing after hard poweroff

2018-10-31 Thread Kevin Risden
So I'm definitely curious what is going on here. Are you still able to reproduce this? Can you check if files have been modified on HDFS? I'd be curious if tlogs or the index is changing underneath for the different restarts. Since there is no new indexing I would guess not but something to

Re: SolrCloud Replication Failure

2018-10-31 Thread Erick Erickson
What version of solr? This code was pretty much rewriten in 7.3 IIRC On Wed, Oct 31, 2018, 10:47 Jeremy Smith Hi all, > > We are currently running a moderately large instance of standalone > solr and are preparing to switch to solr cloud to help us scale up. I have > been running a number

RE: Overseer could not get tags

2018-10-31 Thread Vadim Ivanov
Hi, Chris I had the same messages in solr log while testing 7.4 and 7.5 The only remedy I've found - increasing header size: /opt/solr/server/etc/jetty.xml After solr restart - no more annoying messages > -Original Message- > From: Chris Ulicny [mailto:culicny@iq.media] > Sent:

RE: Odd Scoring behavior

2018-10-31 Thread Webster Homer
The KeywordRepeat and RemoveDuplicates were added to support better wildcard matching. Removing the duplicates just removes those terms that weren't stemmed. This seems like a subtle bug to me -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Tuesday,

SolrCloud Replication Failure

2018-10-31 Thread Jeremy Smith
Hi all, We are currently running a moderately large instance of standalone solr and are preparing to switch to solr cloud to help us scale up. I have been running a number of tests using docker locally and ran into an issue where replication is consistently failing. I have pared down

Re: Overseer could not get tags

2018-10-31 Thread Chris Ulicny
I've managed to replicate this issue with the 7.5.0 release as well by starting up a single instance of solr in cloud mode (on windows) and uploading the security.json file below to it. After a short while, the "could not get tags from node..." messages start coming through every 60 seconds. The

Re: streaming expressions substring-evaluator

2018-10-31 Thread Aroop Ganguly
Thanks for the note Joel. > On Oct 31, 2018, at 5:55 AM, Joel Bernstein wrote: > > The replace operator is going to be "replaced" :) > > Let's create an umbrella ticket for string operations and list out what > would be nice to have. They can probably be added very quickly. > > > Joel

Re: Solr cloud - poweroff procedure

2018-10-31 Thread Walter Underwood
“Take backups” is whatever you need for your environment. In AWS, we snapshot the EBS volumes, and so on. Backing up the Solr install and home directories would be good. There are some core.properties files in there that seem to be useful. Honestly, I don’t have a complete handle on the

RE: Merging data from different sources

2018-10-31 Thread Martin Frank Hansen (MHQ)
Hi Markus, Thanks for your reply! I hope I can make it work as well  -Original Message- From: Markus Jelsma Sent: 30. oktober 2018 22:02 To: solr-user@lucene.apache.org Subject: RE: Merging data from different sources Hello Martin, We also use an URP for this in some cases. We index

Re: streaming expressions substring-evaluator

2018-10-31 Thread Joel Bernstein
The replace operator is going to be "replaced" :) Let's create an umbrella ticket for string operations and list out what would be nice to have. They can probably be added very quickly. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Oct 31, 2018 at 8:49 AM Gus Heck wrote: > Probably

Re: streaming expressions substring-evaluator

2018-10-31 Thread Gus Heck
Probably ReplaceWithSubstringOperation (similar to ReplaceWithFieldOperation thought that would probably add another class be subject to https://issues.apache.org/jira/browse/SOLR-9661) On Wed, Oct 31, 2018 at 8:32 AM Joel Bernstein wrote: > I don't think there is a substring or similar

Re: streaming expressions substring-evaluator

2018-10-31 Thread Joel Bernstein
I don't think there is a substring or similar function. This would be quite nice to add along with other string manipulations. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Oct 31, 2018 at 2:37 AM Aroop Ganguly wrote: > Hey Team > > > Is there a way to extract a part of a string field

Re: Solr cloud - poweroff procedure

2018-10-31 Thread lstusr 5u93n4
Hi, Yes, zookeeper is external, and yes, we'll definitely wait until after solr has stopped to bring it down. Thanks for the tip about disabling `autoAddReplicas`, we definitely don't want the shards moving around during the process. Wunder, your point 3 mentions "take backups". Given that our

Re: Synonyms relationships

2018-10-31 Thread Doug Turnbull
Synonyms in Solr are really a kind of "programmers" tool, useful for mapping terms to other terms. This need not correspond to linguistic notions of a synonym or hypernomy/hyponomy. That being said, there's probably half a dozen approaches for doing these kinds of taxonomical relationships in

Re: Re: SolrCloud scaling/optimization for high request rate

2018-10-31 Thread Sofiya Strochyk
The logfiles on your servers should be verbose enough to indicate what machines are handling which parts of the request. Yes, generally i see the following entries in logs: 1. df=_text_=false=_id=score=4=0=true=fq===24=2==1540984948280=true=javabin 2.

Synonyms relationships

2018-10-31 Thread Nicolas Paris
Hi Does SolR provide a way to describe synonyms relationships such "equivalent to" ,"narrower thant", "broader than" ? It turns out both postgres and oracle do, but I can't find any related information in the documentation. This is useful to allow generalizing the terms of the research or not.

Re: Solr cloud - poweroff procedure

2018-10-31 Thread Shalin Shekhar Mangar
In case you are using a recent Solr 7.x version with collections that have autoAddReplicas=true, you should disable the auto add replicas feature before powering off so that Solr does not decide to move replicas around because nodes have been lost. See

streaming expressions substring-evaluator

2018-10-31 Thread Aroop Ganguly
Hey Team Is there a way to extract a part of a string field and group by on it and obtain a histogram ? for example the filed value is DateTime of the form: 20180911T00 and I want to do a substring like substring(field1,0,7), and then do a streaming expression of the form : rollup(