Re: Solr 7.2.1 DELETEREPLICA automatically NRT replica appears

2018-03-07 Thread Greg Roodt
I'll check the logs when I'm back at my computer. Mostly errors about failing to find the core spamming the logs if I recall correctly. Node never becomes active. Just spams the logs. Only way to remove it is to stop solr in the node and delete the replica via API on another node. On Thu, 8 Mar

Re: Solr 7.2.1 DELETEREPLICA automatically NRT replica appears

2018-03-07 Thread Tomas Fernandez Lobbe
This shouldn’t be happening. Did you see anything related in the logs? Does the new NRT replica ever becomes active? Is there a new core created or do you just see the replica in the clusterstate? Tomas Sent from my iPhone > On Mar 7, 2018, at 8:18 PM, Greg Roodt wrote: >

Solr 7.2.1 DELETEREPLICA automatically NRT replica appears

2018-03-07 Thread Greg Roodt
Hi I am running a cluster of TLOG and PULL replicas. When I call the DELETEREPLICA api to remove a replica, the replica is removed, however, a new NRT replica pops up in a down state in the cluster. Any ideas why? Greg

Re: LTR not picking up modified features

2018-03-07 Thread Roopa ML
Thank you, I Reloaded and collection and see that the change picked up. I had not seen a need to do this in my local environment which is on non cloud mode. Regards Roopa Sent from my iPhone > On Mar 7, 2018, at 7:09 PM, Shawn Heisey wrote: > >> On 3/6/2018 12:57 PM,

Re: Solr Read-Only?

2018-03-07 Thread Shawn Heisey
On 3/6/2018 2:08 PM, Terry Steichen wrote: > Is it possible to run solr in a read-only directory? Solr can be installed as a service on most operating systems other than Windows.  A service installer script comes with the download.  It is installed to run as an unprivileged user, "solr" by

Re: LTR not picking up modified features

2018-03-07 Thread Shawn Heisey
On 3/6/2018 12:57 PM, Roopa Rao wrote: > There was an error in one of the feature definition in Solr LTR > features.json file and I modified and uploaded it to Solr. I can see that > the definition change is uploaded correctly using the feature store url such > as > >

Re: Replicate configoverlay.json

2018-03-07 Thread Shawn Heisey
On 3/6/2018 10:50 AM, Sundaram, Dinesh wrote: > Can you please share the steps to replicate configoverlay.json from > Master to Slave… in other words, how do we replicate from Master to > Slave if any configuration updated via API. If that file is in the same place as solrconfig.xml, then you

Re: Solr Warming Doubts

2018-03-07 Thread Shawn Heisey
On 3/7/2018 12:10 PM, Bsr wrote: I guess i should increase autowarmCount count. whats should be the ideal no. Also is there any way by which i can know that autowarm is completed? There are no generic answers.  You want autowarmCount to be large enough to be effective, but small enough that

LTR not able to upload org.apache.solr.ltr.model.MultipleAdditiveTreesModel

2018-03-07 Thread Roopa Rao
Trying to upload a simple MultipleAdditiveTreesModel, however I am getting an error "msg":"org.apache.solr.ltr.model.ModelException: Model type does not exist org.apache.solr.ltr.model.MultipleAdditiveTreesModel" Root cause seems to be a syntax error in the model file? I did copy this from the

CDCR performance issues

2018-03-07 Thread Tom Peters
I'm having issues with the target collection staying up-to-date with indexing from the source collection using CDCR. This is what I'm getting back in terms of OPS: curl -s 'solr2-a:8080/solr/mycollection/cdcr?action=OPS' | jq . { "responseHeader": { "status": 0,

RE: Solr Read-Only?

2018-03-07 Thread Phil Scadden
I would also second the proxy approach. Beside keeping your solr instance behind a firewall and not directly exposed, you can do a lot in a proxy. Per-user control over which index they are access, filtering of queries, etc. -Original Message- From: Emir Arnautović

Re: What is creating certain fields?

2018-03-07 Thread Cassandra Targett
I'll guess you're using Solr 7.x and those fields in your schema were created automatically? As of Solr 7.0, the schemaless mode field guessing added a copyField rule for any field that's guessed to be text to copy the first 256 characters to a multivalued string field. The way it works is a

Re: Solr Warming Doubts

2018-03-07 Thread Bsr
I guess i should increase autowarmCount count. whats should be the ideal no. Also is there any way by which i can know that autowarm is completed? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: CDCR Invalid Number on deletes

2018-03-07 Thread Chris Troullis
Hey Amrit, thanks for the reply! I checked out SOLR-12036, but it doesn't look like it has to do with CDCR, and the patch that is attached doesn't look CDCR related. Are you sure that's the correct JIRA number? Thanks, Chris On Wed, Mar 7, 2018 at 11:21 AM, Amrit Sarkar

Re: Solr Warming Doubts

2018-03-07 Thread Bsr
Hi Its Just after 1-3 seconds when full import completed. Can you eloborate more on newSearcher and cache i.e what should we set. Also there is no resource crunch with the solr resources. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Solr Warming Doubts

2018-03-07 Thread Shawn Heisey
On 3/7/2018 8:37 AM, Bsr wrote: Whenever i am running the full-import, my response time for some request increases from 80ms to 3000ms. I'll start with the same question Emir asked: Is this *during* the import, or *after* it's done? If it's during the import, then the machine is doing a

Re: Solr 7.2.0 CDCR Issue with TLOG collections

2018-03-07 Thread Amrit Sarkar
Webster, I updated the JIRA: *SOLR-12057 , **CdcrUpdateProcessor* has a hack, it enable *PEER_SYNC* to bypass the leader logic in *DistributedUpdateProcessor.versionAdd,* which eventually ends up in segments not getting created. I wrote a very

Re: Solr Warming Doubts

2018-03-07 Thread Emir Arnautović
Hi, Is it during full import or after full import? If it is during, then it might mean that you don’t have enough resources or maybe GC is more active. You should monitor your system to see if there is some resource starvation. What sort of queries are slower? Does it include faceting? Maybe you

Re: CDCR Invalid Number on deletes

2018-03-07 Thread Amrit Sarkar
Hey Chris, I figured a separate issue while working on CDCR which may relate to your problem. Please see jira: *SOLR-12063* . This is a bug got introduced when we supported the bidirectional approach where an extra flag in tlog entry

Solr Warming Doubts

2018-03-07 Thread Bsr
Whenever i am running the full-import, my response time for some request increases from 80ms to 3000ms. This must be indicating my poor choice of warming up. 1. FirstSearcher I have added some 10 frequent used query but all my autowarmCount are set to 0. I have also added facet for warming. So

Solr Warming Doubts

2018-03-07 Thread Bsr
Whenever i am running the full-import, my response time for some request increases from 80ms to 3000ms. This must be indicating my poor choice of warming up. *1. FirstSearcher* I have added some 10 frequent used query but all my autowarmCount are set to 0. I have also added facet for warming. So

Solr Warming Up Doubts

2018-03-07 Thread Birender Rawat
henever i am running the full-import, my response time for some request increases from 80ms to 3000ms. This must be indicating my poor choice of warming up. *1. FirstSearcher* I have added some 2 frequent used query but all my autowarmCount are set to 0. I have also added facet for warming. So

Re: What is creating certain fields?

2018-03-07 Thread Erick Erickson
Maybe a copyField is realizing the dynamic fields? On Wed, Mar 7, 2018 at 7:43 AM, David Hastings wrote: > those are dynamic fields. > >indexed="false" stored="false"/> > > > On Wed, Mar 7, 2018 at 12:43 AM, Keith Dopson wrote: > >> My

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread Charlie Hull
On 07/03/2018 13:29, lala wrote: Thanks Charlie... It's just confusing for me, In the DIH configuration file, the inner entity that takes "TikaEntityProcessor" as its processor, I can easily specify a tikaConfig attribute to an xml file, located inside the config folder in the core, and where in

Re: What is creating certain fields?

2018-03-07 Thread David Hastings
those are dynamic fields. On Wed, Mar 7, 2018 at 12:43 AM, Keith Dopson wrote: > My default query produces this: > > | { > "id":"44419", > "date":["11/13/17 13:18"], > "url":["http://www.someurl.com;], > "title":["some title"], >

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread Erick Erickson
You're missing Charlie's point, and if you read the blog I pointed you to that point is reiterated. DIH does the Tika processing on the Solr node that is _also_ indexing documents and satisfying queries. Parsing a semi-structured document (PDF in this case) consumes CPU cycles and memory, all

What is creating certain fields?

2018-03-07 Thread Keith Dopson
My default query produces this: | { "id":"44419", "date":["11/13/17 13:18"], "url":["http://www.someurl.com;], "title":["some title"], "content":["some indexed content..."], "date_str":["11/13/17 13:18"],

[ANNOUNCE] Apache Solr 6.6.3 released

2018-03-07 Thread Steve Rowe
7 March 2018, Apache Solr™ 6.6.3 available The Lucene PMC is pleased to announce the release of Apache Solr 6.6.3. Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting,

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread lala
I dont' know what is the problem, when posting the message, the xml format inside the is not correct, it should contain ["<"param name="extractInlineImages" type="bool">true] AND ["<"param name="sortByPosition" type="bool">true]... -- Sent from:

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread lala
Thanks Charlie... It's just confusing for me, In the DIH configuration file, the inner entity that takes "TikaEntityProcessor" as its processor, I can easily specify a tikaConfig attribute to an xml file, located inside the config folder in the core, and where in this file I should be able to

Re: Solr Collection Loosing Leader

2018-03-07 Thread Aaryan Reddy
Folks any suggestion here ? On Thu, Mar 1, 2018 at 12:28 PM, Aaryan Reddy wrote: > Hello All, > > I am running into frequent issue where the leader shard in solr cloud > stays active but does not acknowledge as "leader" . This brings down the > other replicas as they

Secure way to backup Solrcloud

2018-03-07 Thread Daniel Carrasco
Hello, My question is if there is any way to backup a Solr cluster even when all replicas are "not synced"... I'm using the api to create the dumps: http://localhost:8983/solr/admin/collections?action=BACKUP=myBackupName=myCollectionName=/path/to/my/shared/drive But is a lottery where the most

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread Charlie Hull
On 07/03/2018 09:32, lala wrote: Thanks for your reply Erick, Actually I am using Solrj to index files among other operations with Solr, but to index a large amount of differesnt kinds of file, I'm sending a DIH request to Solr using Solrj API : FileListEntityProcessor with TikaEntityParser...

Re: Solr dih extract text from inline images in pdf

2018-03-07 Thread lala
Thanks for your reply Erick, Actually I am using Solrj to index files among other operations with Solr, but to index a large amount of differesnt kinds of file, I'm sending a DIH request to Solr using Solrj API : FileListEntityProcessor with TikaEntityParser... Why not benefit from this

Re: Solr Read-Only?

2018-03-07 Thread Emir Arnautović
Hi Terry, Maybe you can try alternative approaches like putting some proxy in front of Solr and configure it to let only certain URLs. Other option is to define custom update request processor chain that will not include RunUpdateProcessorFactory - that will prevent accidental index updates.