Re: Performance & CPU Usage of 6.2.1 vs 6.5.1 & above

2018-04-18 Thread mganeshs
Hello Deepak, We are not querying when indexing is going on. Whatever CPU graph I shared for 6.2.1 and 6.5.1 was only while we do batch indexing. During that time we don't query and no queries are getting executed. We index in a batch with a rate of around 100 documents / sec. And it's not so

Re: How to ptotect middile initials during search

2018-04-18 Thread Walter Underwood
Or even better, don’t remove stopwords. Stopwords are a technique invented for 16-bit machines, where common words made posting lists too long to to handle. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Apr 18, 2018, at 2:20 PM, Jay Potharaju

Re: need help on search on last name + middile initial

2018-04-18 Thread Shawn Heisey
On 4/18/2018 1:12 PM, Wendy2 wrote: > "debug":{ > "debugQuery mode indicates that Solr dropped the ""A."" when parsing the > query: > ""debug"":{ > ""rawquerystring"":""\""Ellington, A.\, > ""querystring"":""\""Ellington, A.\, > >

Re: How to ptotect middile initials during search

2018-04-18 Thread Jay Potharaju
A is part of stopwords ...that is why it got dropped. Protected words will only stop it from stemming https://lucene.apache.org/solr/guide/6_6/language-analysis.html Thanks Jay Potharaju On Wed, Apr 18, 2018 at 11:35 AM, Wendy2 wrote: > Hi fellow Users, > > Why did Solr

How to ptotect middile initials during search

2018-04-18 Thread Wendy2
Hi fellow Users, Why did Solr return "Ellington, W.R." when I did a name search for "Ellington, A."? I even added "A." in the protwords.txt file. The debugQuery shows that the middle initial got dropped in the parsedquery. How can I make Solr NOT to drop the middle initial? Thanks for your

Re: CdcrReplicator Forwarder not working on some shards

2018-04-18 Thread Susheel Kumar
I was able to resolve this issue by start/stop the cdcr process couple of times until all shards leaders started forwarding updates... Thnx On Tue, Apr 17, 2018 at 3:20 PM, Susheel Kumar wrote: > Hi Amrit, > > The cdcr?action=ERRORS is returning consecutiveErrors=1 on

PF, PF2, PF3 clauses missing in solr7 with query-time synonyms?

2018-04-18 Thread Elizabeth Haubert
I'm seeing pf and pf3 clauses fail to generate in long queries containing synonyms. Wondering if anyone else has run into this, or if it needs to be submitted as a bug in Jira. It is a showstopper problem for the current project, as the pf and pf3 were pretty heavily tuned. Using Solr 7.1; all

need help on search on last name + middile initial

2018-04-18 Thread Wendy2
Hi Solr experts: How can I make sure Solr doesn't drop middle initial when I do a name search? I did a search with double quotes for "Ellington, A.", but Solr parser dropped the middle initial, so I got both back: I even tried keeping A. in the protwords.txt file, but didn't work. Any work

RE: Specialized Solr Application

2018-04-18 Thread Allison, Timothy B.
To be Waldorf to Erick's Statler (if I may), lots of things can go wrong during content extraction.[1] I had two big concerns when I heard of your task: 1) image only pdfs, which can parse without problem, but which might yield 0 content. 2) emails (see, e.g. SOLR-12048) It sounds like

Re: NER question

2018-04-18 Thread Steve Rowe
Hi Alexey, Did you see my response to your “Solr OpenNLP named entity extraction” thread? I think I’ve answered your questions. -- Steve www.lucidworks.com > On Apr 18, 2018, at 4:28 AM, Alexey Ponomarenko > wrote: > > Hi, I have a question regarding NER > >

Re: Specialized Solr Application

2018-04-18 Thread Erick Erickson
Terry: If your process works, then it works and there's no real reason to change. I was commingling the structure of the content with the metadata. You're right that the content doesn't really have any useful structure. Sometimes you can get some useful information out of the metadata,

Re: Writing config directly to zookeeper

2018-04-18 Thread Walter Underwood
I didn’t want to install Solr just so Jenkins could use one script. The Python is standalone. I was using the zkCli tools, which were just not all that well documented. I never could find a description of exactly which files were copied where. The solr.xml directory structure had /conf/, but

Re: Writing config directly to zookeeper

2018-04-18 Thread Erick Erickson
There are some perhaps easier ways to manipulate ZK in the "bin/solr" script if you haven't seen it bin/solr zk -help Best, Erick On Wed, Apr 18, 2018 at 8:30 AM, Arturas Mazeika wrote: > Hi Walter, > > Thanks for the message. Would you care to share the tool with us? I

NER question

2018-04-18 Thread Alexey Ponomarenko
Hi, I have a question regarding NER https://stackoverflow.com/questions/49894727/using-named-entity-extraction-in-solr-7-3 can you help me?

Run solr server using Java program

2018-04-18 Thread rameshkjes
Hi guys, I am able to run the solr instance, add the core and import the data manually. But I want to do everything with the help of Java program, I searched a lot but did not find any relevant answer. In order to run the solr server, i execute following command inside directory:

Re: SolrCloud [subquery] with join on multiple terms

2018-04-18 Thread gallex2000
Thanks, it's working. Regards, Alex G -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: solr 5.4.1 - updates/inserts suddenly very slow. Search still fast

2018-04-18 Thread Shalin Shekhar Mangar
You can get a thread dump by calling http://localhost:8983/solr/admin/threads or by using the Admin UI. On Wed, Apr 18, 2018 at 9:11 PM, Felix XY wrote: > Thank you Emir, but I'm not able to make a thread dump while doing updates > because the updates are very fast

Re: solr 5.4.1 - updates/inserts suddenly very slow. Search still fast

2018-04-18 Thread Felix XY
Thank you Emir, but I'm not able to make a thread dump while doing updates because the updates are very fast again: While I wrote this email my colleague was googling around. He found this http://lucene.472066.n3.nabble.com/HttpSolrServer-commit-is-taking-more-time-td4330954.html and my

Re: Writing config directly to zookeeper

2018-04-18 Thread Arturas Mazeika
Hi Walter, Thanks for the message. Would you care to share the tool with us? I would be interested.. Or have you shared it already? Cheers, Arturas On Wed, Apr 18, 2018 at 5:09 PM, Walter Underwood wrote: > I wrote a Python tool to do this. I use the kazoo package to

Re: Howto change log level with Solr Admin UI ?

2018-04-18 Thread Shalin Shekhar Mangar
The changes made using the admin logging UI are local to the node. It will not change logging settings on other nodes and these changes do not persist between restarts. On Wed, Apr 18, 2018 at 7:33 PM, Bernd Fehling < bernd.fehl...@uni-bielefeld.de> wrote: > I just tried to change the log level

Re: schema-api: modifying schema in xml format

2018-04-18 Thread Arturas Mazeika
Hi Steve, it is reasonable that schema api understand the commands only JSON. Great that you'll update the ref guide. Thanks for taking care of it. Nice of you :-) Cheers, Arturas On Wed, Apr 18, 2018 at 3:27 PM, Steve Rowe wrote: > Hi Arturas, > > The Schema API only

Re: Howto change log level with Solr Admin UI ?

2018-04-18 Thread Emir Arnautović
Hi, It is not exposed in the admin console (would be nice if it is!), but there is a way to set threshold for admin UI logs. You can simply execute following: http://localhost:8983/solr/admin/info/logging?since=0=INFO and INFO logs

Re: solr 5.4.1 - updates/inserts suddenly very slow. Search still fast

2018-04-18 Thread Emir Arnautović
Hi Felix, Did you try to do thread dump while doing update. Did it show anything? Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 18 Apr 2018, at 17:06, Felix XY wrote: >

Re: Specialized Solr Application

2018-04-18 Thread Terry Steichen
Thanks, Erick.  What I don't understand that "rich text documents" (aka, PDF and DOC) lack any internal structure (unlike JSON, XML, etc.), so there's not much potential in trying to get really precise in parsing them.  Or am I overlooking something here? And, as you say, the metadata of such

Re: Writing config directly to zookeeper

2018-04-18 Thread Walter Underwood
I wrote a Python tool to do this. I use the kazoo package to talk to Zookeeper. It starts with the load balancer URL to Solr. 1. Get cluster status. 2. Parse out the Zookeeper config string including chroot. 3. Connect to Zookeeper. 4. Copy the config to the location described in Shawn’s

solr 5.4.1 - updates/inserts suddenly very slow. Search still fast

2018-04-18 Thread Felix XY
Hello group, since two days we have huge problems with our solr 5.4.1 installation. ( yes, we have to update it. But this will not be a solution right now ) All path=/select requests are still very fast. But all /update Requests take >30sec up to 3 minutes. The index is not very big (1.000.000

Re: Howto change log level with Solr Admin UI ?

2018-04-18 Thread Shawn Heisey
On 4/18/2018 8:03 AM, Bernd Fehling wrote: I just tried to change the log level with Solr Admin UI but it does not change any logging on my running SolrCloud. It just shows the changes in the Admin UI and the commands in the request log, but no changes in the level of logging. Do I have to

Howto change log level with Solr Admin UI ?

2018-04-18 Thread Bernd Fehling
I just tried to change the log level with Solr Admin UI but it does not change any logging on my running SolrCloud. It just shows the changes in the Admin UI and the commands in the request log, but no changes in the level of logging. Do I have to RELOAD the collection after changing log level?

Re: Issue with Solr Case Insensitive Issue

2018-04-18 Thread Kapil Bhardwaj
Thanks Shwan, I guess i will get in touch with my DB support team for the full re-index.Even i was in doubt whether the re-index via Core Admin is really serving the purpose. Regards, Kapil Bhardwaj On Wed, Apr 18, 2018 at 6:58 PM Shawn Heisey wrote: > On 4/18/2018 3:45

Re: Issue with Solr Case Insensitive Issue

2018-04-18 Thread Shawn Heisey
On 4/18/2018 3:45 AM, Kapil Bhardwaj wrote: After making changes i RELOADED the schema via terminal command and tried to re-index the schema using solr core admin button. You can't reindex by clicking a button.  Unless it's the same button you used to do the indexing the first time.

Re: schema-api: modifying schema in xml format

2018-04-18 Thread Steve Rowe
Hi Arturas, The Schema API only understands commands in JSON. I looked through the ref guide page, and I’m surprised that this isn’t stated directly; I’ll try to fix that. -- Steve www.lucidworks.com > On Apr 18, 2018, at 4:12 AM, Arturas Mazeika wrote: > > Hi

Re: SolrCloud [subquery] with join on multiple terms

2018-04-18 Thread Mikhail Khludnev
Could it be like article.q=+{!terms f=articleid v=$row.articleid} +{!terms f=variantid v=$row.variantid} +{!terms f=language v=$row.language} ? On Wed, Apr 18, 2018 at 12:33 PM, gallex2000 wrote: > Hi, > > I have two Aliases in SolrCloud, > > 1. *Article *with columns

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Mikhail Khludnev
Injecting headers might require deeper customisation up to establishing own filter or so. Speaking regarding your own WT, there might be some issues because usually it's not a big deal to use one wt for responding user query like (wt=csv) and wt=javabin in internal communication between aggregator

SolrCloud [subquery] with join on multiple terms

2018-04-18 Thread gallex2000
Hi, I have two Aliases in SolrCloud, 1. *Article *with columns id, articleid, variantid, language, content_type, description. 2. *ArticleAttributes *with columns for PARENT id, articleid, variantid, language, description (multivalued field with all values of attributes) and CHILDS (detailed

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Lee Carroll
Inventive. I need to control content-type of the response from the document field value. I have the actual content field and the content-type field to use configured in the response writer. I've just noticed that the xslt transformer allows you to do this but not controlled by document values. I

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Mikhail Khludnev
well .. what if http://localhost:8983/solr/images/select?fl=content=id:1=1=csv; csv.separator==null ? On Wed, Apr 18, 2018 at 1:18 PM, Lee Carroll wrote: > sorry cut n paste error i'd get > > { > "responseHeader":{ > "zkConnected":true, > "status":0, >

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Lee Carroll
sorry cut n paste error i'd get { "responseHeader":{ "zkConnected":true, "status":0, "QTime":0, "params":{ "q":"*:*", "fl":"content", "rows":"1"}}, "response":{"numFound":1,"start":0,"docs":[ { "content":"my-content-value"}] }} but you get

Re: Learning to Rank (LTR) with grouping

2018-04-18 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
I just updated the PR to upstream - I still have to fix some things in distribute mode, but unit tests in non distribute mode works. Hope this helps, Diego From: solr-user@lucene.apache.org At: 04/15/18 03:37:54To: solr-user@lucene.apache.org Subject: Re: Learning to Rank (LTR) with

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Lee Carroll
for http://localhost:8983/solr/images/select?fl=content=id:1=1 I'd get { "responseHeader":{ "zkConnected":true, "status":0, "QTime":1, "params":{ "q":"*:*", "_":"1524046333220"}}, "response":{"numFound":1,"start":0,"docs":[ { "id":"1",

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Mikhail Khludnev
Lee, from this description I don see why it can't be addressed by fl,rows params. What makes it different form the typical Solr usage? On Wed, Apr 18, 2018 at 12:31 PM, Lee Carroll wrote: > Sure, we want to return a single field's value for the top matching >

Issue with Solr Case Insensitive Issue

2018-04-18 Thread Kapil Bhardwaj
Hi Team, Warm Greeting to all, I have started using Solr lately.Currently i am facing issue with case insensitive sorting for a field. We are using Solr on top of Cassandra v5 for index based searching.We have a field layoutpath which we want to make it case insensitive because currently it

Re: custom response writer which extends RawResponseWriter fails when shards > 1

2018-04-18 Thread Lee Carroll
Sure, we want to return a single field's value for the top matching document for a given query. Bare content rather than a full search result listing. To be concrete: For a schema of fields id [unique key], content[stored],content-type[stored] For a request: 1. Request URL:

schema-api: modifying schema in xml format

2018-04-18 Thread Arturas Mazeika
Hi solr-users, is it possible to modify the managed schema using schema api and submit the commands in XML format? I am able to add a data type using: curl -X POST -H 'Content-type:application/json' --data-binary '{ "add-field-type": { "name":"text_de_ph", "class":"solr.TextField",

Re: solr 6.6.3 intermittent group faceting errors(Lucene54DocValuesProducer)

2018-04-18 Thread Jay Potharaju
Thanks Eric & Shawn for chiming in ! In my solrconfig the lucene version is set to 6.6.3. I do see that the index has lucene54 files. With respect to the error regarding group faceting error it is similar to what is being reported in SOLR-7867

Re: Infostream question

2018-04-18 Thread Bernd Fehling
You have to check your log4j.properties, usually located server/resources/log4j.properties There is a line about infostream logging, change it from OFF to ON. # set to INFO to enable infostream log messages log4j.logger.org.apache.solr.update.LoggingInfoStream=OFF Regards Bernd Am 17.04.2018