Re: solr core replication

2017-10-19 Thread Hendrik Haddorp
Hi Erick, that is actually the call I'm using :-) If you invoke http://solr_target_machine:port/solr/core/replication?command=details after that you can see the replication status. But even after a Solr restart the call still shows the replication relation and I would like to remove this so t

Re: Concern on solr commit

2017-10-19 Thread Leo Prince
Thank you Yonik. Since we are using SoftCommits, the docs written will be in RAM until a AutoCommit to reflect onto Disk, I just wanted to know what happens when Solr restarts. Being said, I am using 4.10 and tomcat is handling the Solr, when we restart the tomcat service just before an AutoCommit

Re: Solr nodes going into recovery mode and eventually failing

2017-10-19 Thread Erick Erickson
Once you hit an OOM, the behavior of Java is indeterminate. There's no expectation that things will just pick up where they left off when memory his freed up. Lots of production systems have OOM killer scripts that automatically kill/restart Java apps that OOM for just that reason. Yes, each repli

Re: solr core replication

2017-10-19 Thread Erick Erickson
Little known trick: The fetchIndex replication API call can take any parameter you specify in your config. So you don't have to configure replication at all on your target collection, just issue the replication API command with masterUrl, something like: http://solr_target_machine:port/solr/core/

LTR feature and proximity search with Block Join Parent query Parser

2017-10-19 Thread Dariusz Wojtas
Hi, I am working on features and my main document ('type:entity') has child documents, some of them contain addresses ('type:entityAddress'). My feature definition: { "store": "store_myStore", "name": "scoreAddressCity", "class": "org.apache.solr.ltr.feature.SolrFeature", "params":{ "q": "

Re: Solr nodes going into recovery mode and eventually failing

2017-10-19 Thread shamik
Thanks Emir. The index is equally split between the two shards, each having approx 35gb. The total number of documents is around 11 million which should be distributed equally among the two shards. So, each core should take 3gb of the heap for a full cache. Not sure I get the "multiply it by number

solr core replication

2017-10-19 Thread Hendrik Haddorp
Hi, I want to transfer a Solr collection from one SolrCloud to another one. For that I create a collection in the target cloud using the same config set as on the source cloud but with a replication factor of one. After that I'm using the Solr core API with a "replication?command=fetchindex"

Solr boost property through request handler in solrconfig.xml

2017-10-19 Thread ruby
If I'm not using edismax or dismax, is there a way to boost a specific property through solrconfig.xml? I'm avoiding hard-coding boost in query. Following is my the request handler in solronfig.xml right now explicit 10 myFiled OR fc

Solr boost property through request handler in solrconfig.xml

2017-10-19 Thread ruby
If I'm not using edismax or dismax, is there a way to boost a specific property through solrconfig.xml? I'm avoiding hard-coding boost in query. Following is my the request handler in solronfig.xml right now explicit 10 myFiled OR fc

Re: spell-check does not return collations when using search query with filter

2017-10-19 Thread Arnold Bronley
Let me know if I should open a JIRA issue for this. Thanks. On Tue, Oct 17, 2017 at 10:40 AM, Arnold Bronley wrote: > I tried spellcheck.q=polt and q=tag:polt. I get collations, but they are > only for polt and not tag:polt. Because of that, the hits that I get back > are for frequency of plot a

RE: Certificate issue ERR_SSL_VERSION_OR_CIPHER_MISMATCH

2017-10-19 Thread Younge, Kent A - Norman, OK - Contractor
Resolved the Cipher Mismatch error. Thank you, Kent Younge Systems Engineer USPS MTSC IT Support 600 W. Rock Creek Rd, Norman, OK  73069-8357 O:405 573 2273 -Original Message- From: Younge, Kent A - Norman, OK - Contractor [mailto:kent.a.you...@usps.gov.INVALID] Sent: Thursday,

Re: Measuring time spent in analysis and writing to index

2017-10-19 Thread Zisis T.
I've worked in the past for a Solr 5.x custom plugin using AspectJ to track the # of calls as well as the time spent inside /incrementToken()/ of all Tokenizers and Filters used during indexing. I could get stats per Solr indexing thread, not per indexing request though. In any case you could spot

Measuring time spent in analysis and writing to index

2017-10-19 Thread Nawab Zada Asad Iqbal
Hi, I want to analyze the time spent in different stages during add/update document request. E.g., I want to compare time spend in analysis vs writing to Lucene index. Does Solr provide any such thing? I have looked at [core/admin/mbeans?stats=true&wt=json&indent=true] which provides overall stat

Re: 3 color jvm memory usage bar

2017-10-19 Thread Nawab Zada Asad Iqbal
Thanks Erik I see three colors in the JVM usage bar. Dark Gray, light Gray, white. (left to right). Only one dark and one light color made sense to me (as i could interpret them as used vs available memory), but there is light gray between dark gray and white parts. Thanks Nawab On Thu, Oct 19

Re: Schemaless detecting multivalued fields

2017-10-19 Thread Erick Erickson
Also, if you _know_ certain fields should be defined you can define them explicitly and let schemaless figure out all the others. That said, eventually you're going to have to control your schema, schemaless is _not_ recommended for production systems unless you can absolutely guarantee the input

Re: Deploy Solr to production: best practices

2017-10-19 Thread Walter Underwood
I recommend the “Taking Solr to Production” chapter in the official Solr reference guide. That was my first hit for “solr production” in Google. https://lucene.apache.org/solr/guide/6_6/taking-solr-to-production.html I re

Re: TermsQuery Result Ordering

2017-10-19 Thread Erick Erickson
If it's worth the effort to you, you could write a custom scorer that "somehow" pulled these terms out and did what you require. I suppose some kind of clever function query might work, but again probably custom. Frankly, though, I wouldn't go there until I'd exhausted either my resources or my us

Re: 3 color jvm memory usage bar

2017-10-19 Thread Erick Erickson
Nawab: Images are stripped aggressively by the Apache mail servers, your attachment didn't come through. You'll have to put it somewhere and provide a link. Generally the lighter color in each bar is the available resource and the darker shade is used. Best, Erick On Thu, Oct 19, 2017 at 7:27 A

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread Erick Erickson
https://wiki.apache.org/solr/WhyNoWar Also, recent versions just don't build a war _for_ you. If you insist you can build your own war file by bundling up "the right stuff". However, there's no guarantee that you'll be able to do that going forward. I have to confess that I can't guarantee you can

Re: ClassicAnalyzer Behavior on accent character

2017-10-19 Thread Erick Erickson
Have you looked at the specification to see how it's _supposed_ to work? >From the javadocs: "implements Unicode text segmentation, * as specified by UAX#29." See http://unicode.org/reports/tr29/#Word_Boundaries If you look at the spec and feel that ClassicAnalyzer incorrectly implements the wor

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread GW
Not a Windows user but you should be able to just install it and surf port 8983. Once installed it should show in services https://www.norconex.com/how-to-run-solr5-as-a-service-on-windows/ On 19 October 2017 at 07:18, maximka19 wrote: > Rick Leir-2 wrote > > Maximka > > The app server is bundl

3 color jvm memory usage bar

2017-10-19 Thread Nawab Zada Asad Iqbal
Good morning, What do the 3 colors mean in this bar on Solr dashboard page? (please see attached) : Regards Nawab

Re: TermsQuery Result Ordering

2017-10-19 Thread Webster Homer
Thank you, Erick. That is exactly what I thought. Indeed, we don't care about solr's scoring, as I said we do care about the order of the terms be maintained, hence the requirement for boosting the term values. On Wed, Oct 18, 2017 at 4:23 PM, Erick Erickson wrote: > bq: Can I boost the Terms

ClassicAnalyzer Behavior on accent character

2017-10-19 Thread Chitra
Hi, I indexed a term 'ⒶeŘꝋꝒɫⱯŋɇ' (aeroplane) and the term was indexed as "er l n", some characters were trimmed while indexing. Here is my code protected Analyzer.TokenStreamComponents createComponents(final String fieldName, final Reader reader) { final ClassicTokenizer

Certificate issue ERR_SSL_VERSION_OR_CIPHER_MISMATCH

2017-10-19 Thread Younge, Kent A - Norman, OK - Contractor
Built a clean Solr server imported my certificates and when I go to the SSL/HTTPS page it tells me that I have ERR_SSL_VERSION_OR_CIPHER_MISMATCH in Chrome and in IE tells me that I need to TURN ON TLS 1.0, TLS 1.1, and TLS 1.2. TLS is turned on and if I browse to the server name instead of the

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread maximka19
Rick Leir-2 wrote > Maximka > The app server is bundled in Solr, so you do not install Tomcat or JEtty > separately. > Cheers -- Rick Hi! So, what should I do to host it in Windows Server as service? In production. Thanks -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

RE: SolrJ 6.6.0 Connection pool shutdown now with stack trace

2017-10-19 Thread Markus Jelsma
By the way, we also see a generous amount of warnings in Zookeeper's logs. Are these related? An indication of what? Thanks, Markus 2017-10-19 08:57:35,583 [myid:2] - WARN [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxn@368] - caught end of stream exception EndOfStreamException: Unab

RE: SolrJ 6.6.0 Connection pool shutdown now with stack trace

2017-10-19 Thread Markus Jelsma
Hello, We are having this problem again, now it affects the front-end too, the logs are littered with Zookeeper connection log lines at WARN level. Is it expected that i have to deal with this problem myself? Isn't SolrJ or HTTPClient even going to guarantee me that they will handle underlying

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread Rick Leir
Maximka The app server is bundled in Solr, so you do not install Tomcat or JEtty separately. Cheers -- Rick On October 19, 2017 2:01:30 AM EDT, maximka19 wrote: >Hi everyone! > >I was looking for full-text search engine and chosen Solr. Quickly >introduced with Solr. Now I'm having troubles wit

[ANNOUNCE] [SECURITY] CVE-2017-12629: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-19 Thread Shalin Shekhar Mangar
CVE-2017-12629: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE) Severity: Critical Vendor: The Apache Software Foundation Versions Affected: Solr 5.5.0 to 5.5.4 Solr 6.0.0 to 6.6.1 Solr 7.0.0 to 7.0.1 Description: The details of this vulnerability were reported on public

Deploy Solr to production: best practices

2017-10-19 Thread maximka19
Hi everyone! I started learning full-text search engines and chosen Solr. I'm introduced with Solr, but now I'v having troubles to move Solr to production. *1.* Container: from Solr 5 there is now .WAR-file provided in package. I couldn't deploy Solr 7.1 to Tomcat 9. None of existing tutorials

Re: Solr nodes going into recovery mode and eventually failing

2017-10-19 Thread Emir Arnautović
Hi Shamik, I am pleased to see you find SPM useful! I think that your problems might be related to caches exhausting your memory. You mentioned that your index is 70GB, but how many documents it has? Remember that filter caches can take up to 1bit/doc. With 4096 filter cache size it means that f

Re: Schemaless detecting multivalued fields

2017-10-19 Thread Emir Arnautović
Hi John, You should be able to do that with custom update request processor chain and https://lucene.apache.org/solr/6_6_0//solr-core/org/apache/solr/update/processor/AddSchemaFieldsUpdateProcessorFactory.html

No in-place updates with router.field set

2017-10-19 Thread James
Steps to reproduce: Use Solr in SolrCloud mode. Create collection with implicit routing and router.field set to some field, e.g. "routerfield". Index very small document. Stop time -> X Index very large document. Stop time -> Y Apply update to large document. Note that update command has at least