Re: URGENT Documents automatically getting deleted in SOLR 6.6.0

2019-09-26 Thread Alexandre Rafalovitch
Your system is under attack, something trying to hack into it via Solr. Possibly a cryptominer or similar. And it is using DIH endpoint for it. Shawn explain the most likely cause for Solr actually deleting the records. I would also suggest: 1) Figure out where the request is coming from and

Re: [Apache Solr ReRanking] Sort Clauses Bug

2019-09-26 Thread Alessandro Benedetti
Personally I was expecting the sort request parameter to be applied on the final search results: 1) run original query, get top K based on score 2) run re rank query on the top K, recalculate the scores 3) finally apply the sort But when you mentioned "you expect the sort specified to be applied

Re: auto scaling question - solr 8.2.0

2019-09-26 Thread Joe Obernberger
Just as another data point.  I just tried again, and this time, I got an error from one of the remaining 3 nodes: Error while trying to recover. core=UNCLASS_2019_6_8_36_shard2_replica_n21:java.util.concurrent.ExecutionException: org.apache.solr.client.solrj.SolrServerException: IOException

auto scaling question - solr 8.2.0

2019-09-26 Thread Joe Obernberger
Hi all - I have a 4 node cluster for test, and created several solr collections with 2 shards and 2 replicas each. I'd like the global policy to be to not place more than one replica of the same shard on the same node.  I did this with this curl command: curl -X POST -H

RE: How to split a shard?

2019-09-26 Thread Gael Jourdan-Weil
Thanks for your answer Shawn, let's use the Collections API only then :) Any idea what could cause the "missing index size information for parent shard leader" error message? Regards, Gaël De : Shawn Heisey Envoyé : jeudi 26 septembre 2019 16:58 À :

Re: [Apache Solr ReRanking] Sort Clauses Bug

2019-09-26 Thread Erick Erickson
OK so to restate, you expect the sort specified to be applied to both the “outer” and “inner” queries. Makes sense, seems like a good enhancement. Hmm, I wonder if you can put the sort parameter in with the rerank specification, like: q={!rerank reRankQuery=$rqq reRankDocs=1200 reRankWeight=3

Re: [Apache Solr ReRanking] Sort Clauses Bug

2019-09-26 Thread Alessandro Benedetti
In the first OK scenario, the search results are sorted with score desc, and when the score is identical, the secondary sort field is applied. In the KO scenario, only score desc is taken into consideration(the reranked score) , the secondary sort by the sort field is ignored. I suspect an

Re: How to split a shard?

2019-09-26 Thread Shawn Heisey
On 9/26/2019 8:50 AM, Gael Jourdan-Weil wrote: We are trying to split a single shard into two but we are encountering some issues we don't understand. A) Create a new core "col_core2", then run the SPLIT (https://lucene.apache.org/solr/guide/7_6/coreadmin-api.html#coreadmin-split) If

How to split a shard?

2019-09-26 Thread Gael Jourdan-Weil
Hi, We are trying to split a single shard into two but we are encountering some issues we don't understand. Our current setup: - 1 collection "col" - 1 shard "shard1" - 2 nodes, each having the whole collection (SolrCloud) - 1 core on each node "col_core" What we would like to have is: - 1

Re: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread LEE Ween Jiann
Thank you, this is what I needed to know. On 26/9/19, 9:08 PM, "Shawn Heisey" wrote: On 9/26/2019 4:12 AM, LEE Ween Jiann wrote: > I'm trying to modify the helm chart for solr such that it works for kubernetes (k8s) deployment correctly. There needs to be a particular change in the

Re: how to configure AWS S3 bucket to index data

2019-09-26 Thread nenzius
did you solved this problem? Thanks -- Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: URGENT Documents automatically getting deleted in SOLR 6.6.0

2019-09-26 Thread Shawn Heisey
On 9/26/2019 6:42 AM, Neha wrote: Today i have noticed that in my SOLR instance 6.6.0 documents are getting automatically deleted. In SOLR traces i found below lines and seems it is because of this. 2019-09-26 09:01:21.599 INFO  (qtp225493257-14) [   x:Ecotron] Also the

[JOB] remote job at Help Scout

2019-09-26 Thread Leah Knobler
Hey all! Help Scout , a 100 person remote company that builds helpful customer messaging tools, is looking for a Java Data Engineer

Re: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread Shawn Heisey
On 9/26/2019 4:12 AM, LEE Ween Jiann wrote: I'm trying to modify the helm chart for solr such that it works for kubernetes (k8s) deployment correctly. There needs to be a particular change in the way solr resolves zookeepers hostname in order for this to happen. This is the solr-user mailing

URGENT Documents automatically getting deleted in SOLR 6.6.0

2019-09-26 Thread Neha
Hello SOLR Users, Today i have noticed that in my SOLR instance 6.6.0 documents are getting automatically deleted. In SOLR traces i found below lines and seems it is because of this. 2019-09-26 09:01:21.599 INFO  (qtp225493257-14) [   x:Ecotron] o.a.s.c.S.Request [xyz]  webapp=/solr

Re: [Apache Solr ReRanking] Sort Clauses Bug

2019-09-26 Thread Erick Erickson
Hmmm, can we see a bit of sample output? I always have to read this backwards, the outer query results are sent to the inner query, so my _guess_ is that the sort is applied to the “q=*:*” and then the top 1,200 are sorted by score by the rerank. But then I’m often confused about this. Erick

Re: Undefined field - solr 7.2.1 cloud

2019-09-26 Thread Erick Erickson
BTW, my purpose in suggesting you remove managed schema is just to insure that you’re really using classic. Solr will blow up because it’s unable to find managed-schema if, for some strange reason, you’re really using managed. The two can exist perfectly well together, one should be used and

RE: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread LEE Ween Jiann
SMU Classification: Restricted Yes zookeeper supports dynamic change from 3.5.x. I am referring to Solr here. You would need to specify the list of zookeeper servers in solr.in.sh or solr.in.cmd or as -z param.

Re: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread Jörn Franke
The newest zk version supports dynamic change of the zk instances: https://zookeeper.apache.org/doc/r3.5.3-beta/zookeeperReconfig.html However, for that to work properly in case of a Solr restart you always need a minimal set of servers that do not change and just increase/decrease additional

How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread LEE Ween Jiann
I'm trying to modify the helm chart for solr such that it works for kubernetes (k8s) deployment correctly. There needs to be a particular change in the way solr resolves zookeepers hostname in order for this to happen. Let me explain... The standard way to configure solr is by listing all the

Re: Need more info on MLT (More Like This) feature

2019-09-26 Thread Alessandro Benedetti
In addition to all the valuable information already shared I am curious to understand why you think the results are unreliable. Most of the times is the parameters that cause to ignore some of the terms of the original document/corpus (as simple of the min/max document frequency to consider or min