Re: API call for optimising a collection

2016-05-17 Thread John Bickerstaff
Having run the optimize from the admin UI on one of my three cores in a Solr Cloud collection, I find that when I got to try to run it on one of the other cores, it is already "optimized" I realize that's not the same thing as an API call, but thought it might help. On Tue, May 17, 2016 at 11:22

Re: [scottchu] How to specify multiple zk nodes using solr startcommand under Windows

2016-05-17 Thread John Bickerstaff
I think those zk server warning messages are expected. Until you have 3 running instances you don't have a "Quorum" and the Zookeeper instances complain. Once the third one comes up they are "happy" and don't complain any more. You'd get similar messages if one of the Zookeeper nodes ever went

Why error #400 bad request somtimes where I add doc to Solrcloud collection? [scottchu]

2016-05-17 Thread scott.chu
I add doc by running post.jar with xml files but sometimes get error #400(Bad Request). At first, I doubt the xml files has problem but after I stop nodes and restart them. I add same doc again and it's successful. According post.jar output below, though NMLBOym_a_UN2004_07_20160511_1018.xml

Re: API call for optimising a collection

2016-05-17 Thread Nick Vasilyev
As far as I know, you have to run it on each core. On May 18, 2016 1:04 AM, "Binoy Dalal" wrote: > Is there no api call that can optimize an entire collection? > > I tried the collections api page on the confluence wiki but couldn't find > anything, and a Google search

API call for optimising a collection

2016-05-17 Thread Binoy Dalal
Is there no api call that can optimize an entire collection? I tried the collections api page on the confluence wiki but couldn't find anything, and a Google search also yielded no meaningful results. -- Regards, Binoy Dalal

A quick way to open stay-opened DOS window on any folder under Windows [scottchu]

2016-05-17 Thread scott.chu
I got many helps from this maillist. So this time I'd like to share some useful thing with guys. First, I'm not a cygwin guy under Windows. [Symptom] When we test some open source projects under Windows. We from time to time need: open Windows explorer, change to specific folder, click once

Re: [scottchu] How to specify multiple zk nodes using solr startcommand under Windows

2016-05-17 Thread scott.chu
I tested yesterday and it proves my theory. I'll share what I do under Windows on 1 PC here with you experienced guys and further newbies: 1>Download zookeeper 3.4.8. I unzip it and copy to 3 other different folders: zk_1, zk_2, zk_3. 2>For each zk_n folder, I do these things (Note: {n} means

回复:Re: How to encrypte the password for basic authentication of Solr

2016-05-17 Thread tjlp
Hi, Shawn, Thanks. So, we can not add some credentials at first in the security.json, right? To simplify, I just want to hardcode some users before the Solr starts. Googling the topic about "Basic authentication for Solr Admin panel", we can call jetty utility for generating encrypted password

Re: SolrCloud multiple collections each with unique schema via SolrJ?

2016-05-17 Thread Boman
Got it! I now use uploadConfig to load the default config for each new collection I create, and then modify the schema. Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-multiple-collections-each-with-unique-schema-via-SolrJ-tp4277397p4277406.html Sent from

Re: SolrCloud multiple collections each with unique schema via SolrJ?

2016-05-17 Thread Shawn Heisey
On 5/17/2016 7:00 PM, Boman wrote: > I load the defaul config using scripts/cloud-scripts/zkcli.sh -cmd upconfig > after which collections are created programmatically and the schema modified > as per each collection's requirements. > > I now notice that it is the SAME "default" original schema

SolrCloud multiple collections each with unique schema via SolrJ?

2016-05-17 Thread Boman
I load the defaul config using scripts/cloud-scripts/zkcli.sh -cmd upconfig after which collections are created programmatically and the schema modified as per each collection's requirements. I now notice that it is the SAME "default" original schema that holds ALL the modifications (new fields).

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Stephen Weiss
Gotcha - well that's nice. Still, we seem to be permanently out of sync. I see this thread with someone having a similar issue: https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201601.mbox/%3c09fdab82-7600-49e0-b639-9cb9db937...@yahoo.com%3E It seems like this is not really fixed in

RE: SolrCloud replicas consistently out of sync

2016-05-17 Thread Markus Jelsma
Hi, thats a known issue and unrelated: https://issues.apache.org/jira/browse/SOLR-9120 M. -Original message- > From:Stephen Weiss > Sent: Tuesday 17th May 2016 23:10 > To: solr-user@lucene.apache.org; Aleksey Mezhva ; > Hans Zhou

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Stephen Weiss
I should add - looking back through the logs, we're seeing frequent errors like this now: 78819692 WARN (qtp110456297-1145) [ ] o.a.s.h.a.LukeRequestHandler Error getting file length for [segments_4o] java.nio.file.NoSuchFileException:

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Stephen Weiss
OK, so we did as you suggest, read through that article, and we reconfigured the autocommit to: ${solr.autoCommit.maxTime:3} false ${solr.autoSoftCommit.maxTime:60} However, we see no change, aside from the fact that it's clearly committing more frequently. I will say on our end,

Solrj Basic Authentication randomly failing - request has come without principal

2016-05-17 Thread Shamik Bandopadhyay
Hi, I'm facing this issue where SolrJ calls are randomly failing on basic authentication. Here's exception: ERROR923629[qtp466002798-20] - org.apache.solr.security.PKIAuthenticationPlugin.doAuthenticate(PKIAuthenticationPlugin.java:125) - Invalid key INFO923630[qtp466002798-20] -

Re: [scottchu] How to specify multiple zk nodes using solr start command under Windows

2016-05-17 Thread John Bickerstaff
it's roundabout, but this might work -- ask for the healthcheck status (from the solr box) and hit each zkNode separately. I'm on Linux so you'll have to translate to Windows... using the solr.cmd file I assume... ./solr healthcheck -z 192.168.56.5:2181/solr5_4 -c collectionName ./solr

Re: [scottchu] How to specify multiple zk nodes using solr start command under Windows

2016-05-17 Thread Abdel Belkasri
The repetition is just a cut and paste from Scott's post. How can I check if I am getting the ensemble or just a single zk? Also if this is not the way to specify an ensemble, what is the right way? Because the comma delimited list does not work, I concur with Scott. On Tue, May 17, 2016 at

Re: Specifying dynamic field type without polluting actual field names with type indicators

2016-05-17 Thread Shawn Heisey
On 5/17/2016 9:05 AM, Horváth Péter Gergely wrote: > By default Solr allows you to define the type of a dynamic field by > appending a post-fix to the name itself. E.g. creating a color_s field > instructs Solr to create a string field. My understanding is that if we do > this, all queries must

Re: [scottchu] How to specify multiple zk nodes using solr start command under Windows

2016-05-17 Thread Erick Erickson
Are you absolutely sure you're getting an _ensemble_ and not just connecting to a single node? My suspicion (without proof) is that you're just getting one -z option. It'll work as long as that ZK instance stays up, but it won't be fault-tolerant. And again you repeated the port (2181) twice.

Re: Updating error while add doc to Solrcloud

2016-05-17 Thread Erick Erickson
I _think_ you are using "schemaless" mode and the issue is that Solr guesses the type of the field based on the first doc it encounters. Thereafter, if any incoming doc has a different field (say the "guess" is an int type and later something that's not an int is in that field) then it is

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Erick Erickson
OK, these autocommit settings need revisiting. First off, I'd remove the maxDocs entirely although with the setting you're using it probably doesn't matter. The maxTime of 1,200,000 is 20 minutes. Which means if you evern un-gracefully kill your shards you'll have up to 20 minutes worth of data

can custom search component access data in custom post filter component?

2016-05-17 Thread ruby
Is there any way a custom search component can access data created in custom post filter query component so that the data can be added to the response? -- View this message in context:

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Stephen Weiss
Yes, after startup there was a recovery process, you are right. It's just that this process doesn't seem to happen unless we do a full restart. These are our autocommit settings - to be honest, we did not really use autocommit until we switched up to SolrCloud so it's totally possible they are

Re: Specifying dynamic field type without polluting actual field names with type indicators

2016-05-17 Thread Steve Rowe
Hi Peter, Are you familiar with the Schema API?: You can use it to create fields, field types, etc. prior to ingesting your data. -- Steve www.lucidworks.com > On May 17, 2016, at 11:05 AM, Horváth Péter Gergely >

Re: Adding information to Solr response in custom filter query code?

2016-05-17 Thread ruby
Thanks for your reply. Is there any way a custom search component can access data created in custom post filter query so that the data can be added to the response? -- View this message in context:

Specifying dynamic field type without polluting actual field names with type indicators

2016-05-17 Thread Horváth Péter Gergely
Hi All, By default Solr allows you to define the type of a dynamic field by appending a post-fix to the name itself. E.g. creating a color_s field instructs Solr to create a string field. My understanding is that if we do this, all queries must refer the post-fixed field name as well. So instead

Specifying dynamic field type without polluting actual field names with type indicators

2016-05-17 Thread Horváth Péter Gergely
Hi All, By default Solr allows you to define the type of a dynamic field by appending a post-fix to the name itself. E.g. creating a color_s field instructs Solr to create a string field. My understanding is that if we do this, all queries must refer the post-fixed field name as well. So instead

Re: [scottchu] How to specify multiple zk nodes using solr start command under Windows

2016-05-17 Thread Abdel Belkasri
Hi Scott, what worked for me in Windows is this (no ",") bin\Solr start -c -s mynodes\node1 -z localhost:2181 -z localhost:2181 -z localhost:2183 -- Hope this helps Abdel. On Tue, May 17, 2016 at 3:35 AM, scott.chu wrote: > I start 3 zk nodes at port 2181,2182, and 2183

Re: Dynamically change solr suggest field

2016-05-17 Thread Lasitha Wattaladeniya
Hi Alessandro, Yes, using suggester is the correct way of doing. But in our scenario we thought of going with the spellchecker component since we had some legacy setup. Our plan is to move to suggester later on. So far spellchecker component also does the needed work for us. Regards, Lasitha

Conditional atomic update

2016-05-17 Thread Chris Yee
I'm looking for a way to do an atomic update, but if a certain condition exists on the existing document, abort the update. Each document has the fields id, count, and value. The source data has just id and value. When the source data is indexed, I use atomic updates to: - Increment the count

Re: Sorting for MLT results

2016-05-17 Thread Alessandro Benedetti
using the more like this query parser should solve your problem ! Just use that query parser and than sort as usual. Cheers On Wed, May 11, 2016 at 4:53 AM, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to check, is there a function to do the sorting for MLT results >

Re: json.facet streaming

2016-05-17 Thread Nick Vasilyev
Got it. Thanks for clarifying. On Tue, May 17, 2016 at 9:58 AM, Yonik Seeley wrote: > On Tue, May 17, 2016 at 9:41 AM, Nick Vasilyev > wrote: > > Hi Yonik, I do see them in the response, but the JSON format is like > > standard facet output. I am

Re: json.facet streaming

2016-05-17 Thread Yonik Seeley
On Tue, May 17, 2016 at 9:41 AM, Nick Vasilyev wrote: > Hi Yonik, I do see them in the response, but the JSON format is like > standard facet output. I am not sure what streaming facet response would > look like, but I expected it to be similar to the streaming API. Is

Re: json.facet streaming

2016-05-17 Thread Nick Vasilyev
Hi Yonik, I do see them in the response, but the JSON format is like standard facet output. I am not sure what streaming facet response would look like, but I expected it to be similar to the streaming API. Is this the case? On Tue, May 17, 2016 at 9:35 AM, Yonik Seeley wrote:

Re: [scottchu] How to specify multiple zk nodes using solr start commandunder Windows

2016-05-17 Thread John Bickerstaff
In your original command, you listed the same port twice. That may have been at least part of the difficulty. It's probably fine to just use one zk node - as the zookeeper instances should be aware of each other. I also assume that if your solr.in.sh (or windows equavalent) has the properly

Re: json.facet streaming

2016-05-17 Thread Yonik Seeley
So it looks like facets are being computed... do you not see them in the response? -Yonik On Tue, May 17, 2016 at 9:12 AM, Nick Vasilyev wrote: > I enabled query debugging, here is the facet-trace snippet. > > "facet-trace":{ >

Re: How to encrypte the password for basic authentication of Solr

2016-05-17 Thread Shawn Heisey
On 5/17/2016 7:23 AM, Shawn Heisey wrote: > On 5/17/2016 2:23 AM, t...@sina.com wrote: >> How to get the encrypted password for the user solr? > You can't get the password from the encrypted version. That's the > entire point of encrypting the password. It occurred to me after I sent this that

Re: How to encrypte the password for basic authentication of Solr

2016-05-17 Thread Shawn Heisey
On 5/17/2016 2:23 AM, t...@sina.com wrote: > In the Wiki > https://cwiki.apache.org/confluence/display/solr/Basic+Authentication+Plugin, > the basic authentication is introduced. With basic authentication, the > sample security.json has the line >

Re: state.json being downloaded every 10 seconds

2016-05-17 Thread Shawn Heisey
On 5/16/2016 10:28 PM, Jeff Wartes wrote: > One thing that still feels a bit odd though is that the health check query > was referencing a collection that no longer existed in the cluster. So it > seems like it was downloading the state for ALL non-hosted collections, not a > requested one. > >

Re: json.facet streaming

2016-05-17 Thread Nick Vasilyev
I enabled query debugging, here is the facet-trace snippet. "facet-trace":{ "processor":"FacetQueryProcessor", "elapse":0, "query":null, "domainSize":43046041, "sub-facet":[{ "processor":"FacetFieldProcessorStream", "elapse":0,

Re: json.facet streaming

2016-05-17 Thread Yonik Seeley
Perhaps try turning on request debugging and see what is actually being received by Solr? -Yonik On Tue, May 17, 2016 at 8:33 AM, Nick Vasilyev wrote: > I am on the nightly build of 6.1 and I am experimenting with json.facet > streaming, however the response I am

json.facet streaming

2016-05-17 Thread Nick Vasilyev
I am on the nightly build of 6.1 and I am experimenting with json.facet streaming, however the response I am getting back looks like regular query response. I was expecting something like the streaming api. Is this right or am I missing something? Hhere is the json.facet string.

Re: SolrCloud replicas consistently out of sync

2016-05-17 Thread Daniel Collins
Terminology question: by nodes I assume you mean machines? So "8 nodes, with 4 shards a piece, all running one collection with about 900M documents", is 1 collection split into 32 shards, with 4 shards located on each machine? Is each shard in its own JVM, or do you have 1 JVM on each machine

RE: http request to MiniSolrCloudCluster

2016-05-17 Thread Rohana Rajapakse
Thanks Chris for your reply. I am aware of the requirement for "baseDir" to be empty. In my test code, I delete the "baseDir" completely (in Junit test setup) and re-create it to make sure there is nothing in it. So, there shouldn't be a problem with that. What I am puzzled is that I have no

Re: [scottchu] How to specify multiple zk nodes using solr start commandunder Windows

2016-05-17 Thread scott.chu
I issue '-z localhost:2181 -z localhost:2182 -z localhost:2183' for each node's start command and later when I create collection, all 3 zk nodes has registered my configset. Never try but I think maybe only use -z localhost:2181, then all 3 nodes in zk ensemble will synchronize themselves.

Re: Creating a collection with 1 shard gives a weird range

2016-05-17 Thread Tom Evans
On Tue, May 17, 2016 at 9:40 AM, John Smith wrote: > I'm trying to create a collection starting with only one shard > (numShards=1) using a compositeID router. The purpose is to start small > and begin splitting shards when the index grows larger. The shard > created gets

Sub faceting on string field using json facet runs extremly slow

2016-05-17 Thread Vijay Tiwary
Hello all, I have an index of 8 shards having 1 replica each distubuted across 8 node solr cloud . Size of index is 300 gb having 30 million documents. Solr json facet runs extremly slow if I am sub faceting on string field even if tnumfound is only around 2 (also I am not returning any rows

Creating a collection with 1 shard gives a weird range

2016-05-17 Thread John Smith
I'm trying to create a collection starting with only one shard (numShards=1) using a compositeID router. The purpose is to start small and begin splitting shards when the index grows larger. The shard created gets a weird range value: 8000-7fff, which doesn't look effective. Indeed, if a

How to encrypte the password for basic authentication of Solr

2016-05-17 Thread tjlp
Hi, In the Wiki https://cwiki.apache.org/confluence/display/solr/Basic+Authentication+Plugin, the basic authentication is introduced. With basic authentication, the sample security.json has the line "credentials":{"solr":"IV0EHq1OnNrj6gvRCwvFwTrZ1+z1oBbnQdiVC3otuq0=

[scottchu] How to specify multiple zk nodes using solr start command under Windows

2016-05-17 Thread scott.chu
I start 3 zk nodes at port 2181,2182, and 2183 on my local machine. Go into Solr 5.4.1 root folder and issue and issue the command in article 'Setting Up an External ZooKeeper Ensemble' in reference guide bin\Solr start -c -s mynodes\node1 -z localhost:2181,localhost:2181,localhost:2183 but

Re: Updating error while add doc to Solrcloud

2016-05-17 Thread scott.chu
I clear the cugna collection data (by renaming 'data' folder to 'xdata')and restart Solrcloud. I add previous possible-error xml doc, it succeeds. So I'm sure doc data has no problem. Is it because the index file size is too large? If the zk nodes fails during adding doc, could it cause this

Updating error while add doc to Solrcloud

2016-05-17 Thread scott.chu
I build Solrcloud with 2 nodes, 1 shard, 2 replica. I add doc in xml format using post.jar up to 2.85M+ no. of docs and 10gb index size. When I add more docs. the solr.log shows: -- 2016-05-17 14:01:09,024 WARN (main) [ ] o.e.j.s.h.RequestLogHandler