Solrcloud admin not asking for login

2018-09-21 Thread Sushant Vengurlekar
I followed the guide over here to setup basic auth for solrcloud http://lucene.apache.org/solr/guide/7_4/basic-authentication-plugin.html Our solrcloud version is 7.4. We have a 3 ensemble zookeeper with 3 solr nodes. I restarted the zookeepers and the solr nodes but I still don't get prompted

Re: Rule-based replication or sharing

2018-09-21 Thread Chuck Reynolds
Thanks Steve, I saw this option but it will mean a bit of re-working our automation to implement. The documentation talks about an EC2Snitch. I wish it worked like Cassandra where you just say that's the one you're using and it figures out how to replicate the data On 9/21/18, 2:40 PM,

Re: Rule-based replication or sharing

2018-09-21 Thread Shawn Heisey
On 9/21/2018 2:07 PM, Chuck Reynolds wrote: I'm using Solr 6.6 and I want to create a 90 node cluster with a replication factor of three. I'm using AWS EC2 instances and I have a requirement to replicate the data into 3 AWS availability zones. So 30 servers in each zone and I don't see a

Re: Rule-based replication or sharing

2018-09-21 Thread Steve Rowe
Hi Chuck, One way to do it is to set a system property on the JVM running each Solr node, corresponding to the the AWS availability zone on which the node is hosted. For example, you could use sysprop “AWSAZ”, then use rules like: replica:<2,sysprop.AWSAZ:us-east-1

Re: TolerantUpdateProcessorFactory maxErrors=-1 issue

2018-09-21 Thread Tomás Fernández Löbbe
Hi Derek, I suspect you need to move the TolerantUpdateProcessorFactory to the beginning of the chain On Thu, Sep 20, 2018 at 6:17 PM Derek Poh wrote: > Does any one have any idea whatcould be the causeof this? > > On 19/9/2018 11:40 AM, Derek Poh wrote: > > In addition, I tried withmaxErrors=3

Rule-based replication or sharing

2018-09-21 Thread Chuck Reynolds
I'm using Solr 6.6 and I want to create a 90 node cluster with a replication factor of three. I'm using AWS EC2 instances and I have a requirement to replicate the data into 3 AWS availability zones. So 30 servers in each zone and I don't see a create collection rule that will put one replica

Nested entity unrolled record

2018-09-21 Thread Tanya Bompi
Hi, I am wondering if with a nested entity using the DataImportHandler, if its possible to unroll the parent document with each of the child document as a separate entry in the index. I am using Solr 7.4.0 version. For eg. So what i would desire the index to

Re: Solr 7.2.1 Collection Backup Performance issue

2018-09-21 Thread Walter Underwood
I don’t know how well it worked, but for a while, I did this to warm up the file buffers. It should be OK if RAM is bigger than data. Though “cat” probably opens the files with the hint that it will never re-read the data. find /solr-data-dir -type f | xargs cat > /dev/null Basically, read

Re: Solr 7.2.1 Collection Backup Performance issue

2018-09-21 Thread Ganesh Sethuraman
We don't have all the index size fit in into memory, but we still have an acceptable performance as of now for reads/query. But with BACKUP we are seeing a increase in the OS memory usage. Given that, I am sure many of system might be running with less memory but good enough for their application.

Re: Solr 6.x and java 8

2018-09-21 Thread Shawn Heisey
On 9/21/2018 1:23 PM, tedsolr wrote: My application environment runs java 1.8. However I'm stuck building to 1.7 for now. I can still use SolrJ 6.1 in my app as long as I only deploy the SolrJ JAR and not build it from source. Right? If you can get your code to build while targeting 1.7, I

Re: Solr 6.x and java 8

2018-09-21 Thread tedsolr
Shawn, My application environment runs java 1.8. However I'm stuck building to 1.7 for now. I can still use SolrJ 6.1 in my app as long as I only deploy the SolrJ JAR and not build it from source. Right? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Query with exact number of tokens

2018-09-21 Thread Jan Høydahl
I have made a FieldType specially for this https://github.com/cominvent/exactmatch/ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 21. sep. 2018 kl. 18:14 skrev Steve Rowe : > > Link correction - wrong fragment identifier

Re: [SolrJ Client] Error calling add: connection is still allocated

2018-09-21 Thread Shawn Heisey
On 9/21/2018 10:31 AM, Christopher Schultz wrote: For those interested, it looks like I was naïvely using BasicHttpClientConnectionManager, which is totally inappropriate in a multi-user threaded environment. I switched to PooledHttpClientConnectionManager and that seems to be working much

Re: [SolrJ Client] Error calling add: connection is still allocated

2018-09-21 Thread Christopher Schultz
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 All, On 9/18/18 11:10, Christopher Schultz wrote: > All, > > Our single-instance Solr server is just getting its first taste of > production load, and I'm seeing this periodically: > > java.lang.IllegalStateException: Connection is still

Re: Query with exact number of tokens

2018-09-21 Thread Steve Rowe
Link correction - wrong fragment identifier in ref #5 - should be: [5] https://lucene.apache.org/solr/guide/7_4/other-parsers.html#function-range-query-parser -- Steve www.lucidworks.com > On Sep 21, 2018, at 12:04 PM, Steve Rowe wrote: > > Hi Sergio, > > Chris “Hoss” Hostetter has a

Re: Query with exact number of tokens

2018-09-21 Thread Steve Rowe
Hi Sergio, Chris “Hoss” Hostetter has a solution to this kind of problem here: https://lists.apache.org/thread.html/6b0f0cb864aa55f0a9eadfd92d27d374ab8deb16e8131ed2b7234463@%3Csolr-user.lucene.apache.org%3E . See also the suggestions in comments on SOLR-12673[1], which include a version of

Re: Corrupted index in SolrCloud

2018-09-21 Thread Matt Pearce
Thanks for the explanation Erick, that makes sense! Matt On 21/09/2018 15:50, Erick Erickson wrote: The disk corruption is, of course, a red flag and likely the root cause. As for how it replicated let's assume a 2 replica shard (leader + follower). If the follower ever went into full

Re: Query with exact number of tokens

2018-09-21 Thread Walter Underwood
How about sorting the tokens in alphabetical order both for indexing and query, then using the sentinel trick. Source text: CENTURY BANCORP, INC Solr text: SENTINEL bancorp century inc SENTINEL wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Sep

Re: Query with exact number of tokens

2018-09-21 Thread Alexandre Rafalovitch
Hmm, I was suggesting to put TokenCountingFilter at the end of both indexing and query chains for the same (e.g. name_count) field. Then, the search would be something like (warning, major syntax errors): .../select? queryname=CENTURY BANCORP, INC& q=*:* fq={!eDisMax v=queryname mm=100%}name&

Re: Corrupted index in SolrCloud

2018-09-21 Thread Erick Erickson
The disk corruption is, of course, a red flag and likely the root cause. As for how it replicated let's assume a 2 replica shard (leader + follower). If the follower ever went into full recovery it would use old-style replication to copy down the entire index, corrupted index and all, from the

Re: Query with exact number of tokens

2018-09-21 Thread Erick Erickson
A variant on Alexandre's approach is: at index time, count the tokens that will be produced yourself (this may be a little tricky, you shouldn't have WordDelimiterFilterFactory in your analysis for instance). Put the number of tokens in a separate field At query time, you'd search

Re: Query with exact number of tokens

2018-09-21 Thread Alexandre Rafalovitch
I think you can match everything in the query to the field using either 1) disMax/eDisMax with mm=100% https://lucene.apache.org/solr/guide/7_4/the-dismax-query-parser.html#mm-minimum-should-match-parameter 2) Complex Phrase Query Parser with inOrder=false:

admin auth

2018-09-21 Thread John Blythe
hi everyone! we had authentication setup for our cloud deploy that for some reason or another disappeared after some updates. we didn't realize it immediately so aren't sure what triggered the change. curl requests still require auth but our admin panel is accessible. further, our local setup

Corrupted index in SolrCloud

2018-09-21 Thread Matt Pearce
Hi, We've just been working with a client who had a corruption issue with their SolrCloud install. They're running Solr 5.3.1, with a collection spread across 12 shards. Each shard has a single replica. They were seeing "Index Corruption" errors when running certain queries. We

Re: Query with exact number of tokens

2018-09-21 Thread Michael Kuhlmann
Hi Sergio, alas that's not possible that way. If you search for CENTURY BANCORP, INC., then Solr will be totally happy to find all these terms in "NEW CENTURY BANCORP, INC." and return it with a high score. But you can prepare your data at index time. Make it a multivalued field of type string

Re: Query with exact number of tokens

2018-09-21 Thread Andrea Gazzarini
Oops, sorry...too much rush in reading, I didn't read the second part. Please forget my answer ;) Andrea On 21/09/18 15:52, Andrea Gazzarini wrote: Hi Sergio, assuming that you don't want to disable tokenisation (otherwise you can define the indexed field as a string and search it as a

Re: Query with exact number of tokens

2018-09-21 Thread Andrea Gazzarini
Hi Sergio, assuming that you don't want to disable tokenisation (otherwise you can define the indexed field as a string and search it as a whole), in "Relevant Search" the authors describe a cool approach using the so called "Sentinel Tokens", which are symbolic tokens representing the

Query with exact number of tokens

2018-09-21 Thread marotosg
Hi, I have to search for company names where my first requirement is to find only exact matches on the company name. For instance if I search for "CENTURY BANCORP, INC." I shouldn't find "NEW CENTURY BANCORP, INC." because the result company has the extra keyword "NEW". I can't use exact match

Alias Not Showing Up in Solr Manager

2018-09-21 Thread THADC
Hi, When we create an alias from browser, like: *http://blah/solr/admin/collections?action=CREATEALIAS=blah-alias=blah-collection* , the alias is successfully created, but I would expect it to appear on the solr admin on the collections tab (where an alias can be created and deleted). thanks

Re: Solr 7.X negative filter not working

2018-09-21 Thread damian.pawski
Hi Yonik Seeley This is what I am getting, for JobTitle: (NOT resources ) I am getting: /parsed_filter_queries: parsed_filter_queries:*+(-*JobTitle:resources)/ for -JobTitle:(resources ) /parsed_filter_queries:

RE: Grammatical tenses Stemming in SOLR

2018-09-21 Thread Markus Jelsma
Hello Aishwarya, KStem does a really bad job with the examples you have given, it won't remove the -s and -ing suffixes in some strange cases. Porter/Snowball work just fine for this example. What won't work, of course, are irregular verbs and nouns (plural forms). They always need to be

Grammatical tenses Stemming in SOLR

2018-09-21 Thread aishwarya
1 down vote favorite i want to know which stemming filter factory can be used to fetch all the possible tenses of a stem word. example : if "run" is the search word -> it has to fetch results for all files involving run , running , runs , ran. also the vice-versa --> whichever tense of a word