Re: Poll: Master-Slave or SolrCloud?

2017-04-24 Thread Erick Erickson
Otis:

bq: But it doesn't really matter so much whether people are the same or not

I'm going to gently disagree here. I regularly see questions on the
user's list about upgrading from 4.x or 3.x (!). So if the sample of
users responding to your poll are substantially the same users as
responded in 2013, there's no guarantee that they've even upgraded
Solr, much less thought it worthwhile to change their paradigm.

I suppose an interesting bit of additional data would be "when did you
start using Solr?". Would there be a greater percentage of responders
using SolrCloud in 2014 .vs. 2013? 2015 .vs. 2014? and so on.

Mind you I have zero data to support any of this, it's speculation and
I haven't looked at the poll so maybe I'm off base

Erick

On Mon, Apr 24, 2017 at 7:29 PM, Otis Gospodnetić
 wrote:
> Hi,
>
> I think it's roughly the same profile of people.  The poll from 2013 was on
> Sematext blog and the new one is on Sematext Twitter account.  But it
> doesn't really matter so much whether people are the same or not.  What
> amazes me that in 2017 we don't see a lot more SolrCloud users!
>
> Otis
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>
>
> On Mon, Apr 24, 2017 at 8:04 PM, Erick Erickson 
> wrote:
>
>> Yeah, this is kind of counter to my expectations too. I guess my
>> question is whether the same people are responding to the new survey
>> as the old one. "If it ain't broke" and all that.
>>
>> Erick
>>
>> On Mon, Apr 24, 2017 at 7:58 AM, Otis Gospodnetić
>>  wrote:
>> > Hi,
>> >
>> > I'm really really surprised here.  Back in 2013 we did a poll to see how
>> > people were running Master-Slave (4.x back then) and SolrCloud was a bit
>> > more popular than Master-Slave:
>> > https://sematext.com/blog/2013/02/25/poll-solr-cloud-or-not/
>> >
>> > Here is a fresh new poll with pretty much the same question - How do you
>> > run your Solr? 
>> -
>> > and guess what?  SolrCloud is *not* at all a lot more prevalent than
>> > Master-Slave.
>> >
>> > We definitely see a lot more SolrCloud used by Sematext Solr
>> > consulting/support customers, so I'm a bit surprised by the results of
>> this
>> > poll so far.
>> >
>> > Is anyone else surprised by this?  See https://twitter.com/sematext/
>> > status/854927627748036608
>> >
>> > Thanks,
>> > Otis
>> > --
>> > Monitoring - Log Management - Alerting - Anomaly Detection
>> > Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>>


Re: Poll: Master-Slave or SolrCloud?

2017-04-24 Thread Otis Gospodnetić
Hi,

I think it's roughly the same profile of people.  The poll from 2013 was on
Sematext blog and the new one is on Sematext Twitter account.  But it
doesn't really matter so much whether people are the same or not.  What
amazes me that in 2017 we don't see a lot more SolrCloud users!

Otis
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/


On Mon, Apr 24, 2017 at 8:04 PM, Erick Erickson 
wrote:

> Yeah, this is kind of counter to my expectations too. I guess my
> question is whether the same people are responding to the new survey
> as the old one. "If it ain't broke" and all that.
>
> Erick
>
> On Mon, Apr 24, 2017 at 7:58 AM, Otis Gospodnetić
>  wrote:
> > Hi,
> >
> > I'm really really surprised here.  Back in 2013 we did a poll to see how
> > people were running Master-Slave (4.x back then) and SolrCloud was a bit
> > more popular than Master-Slave:
> > https://sematext.com/blog/2013/02/25/poll-solr-cloud-or-not/
> >
> > Here is a fresh new poll with pretty much the same question - How do you
> > run your Solr? 
> -
> > and guess what?  SolrCloud is *not* at all a lot more prevalent than
> > Master-Slave.
> >
> > We definitely see a lot more SolrCloud used by Sematext Solr
> > consulting/support customers, so I'm a bit surprised by the results of
> this
> > poll so far.
> >
> > Is anyone else surprised by this?  See https://twitter.com/sematext/
> > status/854927627748036608
> >
> > Thanks,
> > Otis
> > --
> > Monitoring - Log Management - Alerting - Anomaly Detection
> > Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>


Re: Troubleshooting solr errors

2017-04-24 Thread Rick Leir
Daniel,
Would it be too much trouble to get some text out of that particular email 
message, and try it in the Solr Admin Analysis tool? 

By the way, I also have my email in Dovecot. Would you be able to describe how 
you index it and how you query to find an email? Perhaps with scripts in a 
github project?
Thanks -- Rick

On April 24, 2017 5:55:29 PM EDT, Daniel Miller  wrote:
>I'm running Solr 6.4.2 to index my mail server (Dovecot). Searching is 
>great - but periodically I have Solr errors. Previously, when an error 
>would occur Solr would terminate.  I now have it running as a systemd 
>service so it would auto-restart - but it seems like that doesn't solve
>it.
>
>Some of the log lines include:
>
>2017-04-24 18:18:31.101 ERROR (qtp594427726-30) [   x:dovecot] 
>o.a.s.h.RequestHandlerBase org.apache.solr.common.SolrException: 
>Exception writing document id 
>17697/7db132200dd2df4d2f7b3bc41c5f/dmil...@amfes.com to the index; 
>possible analysis error.
>
>2017-04-24 18:18:31.125 ERROR (qtp594427726-32) [   x:dovecot] 
>o.a.s.h.RequestHandlerBase org.apache.solr.common.SolrException: Error 
>opening new searcher
>
>I don't know what else to provide to try to troubleshoot this.
>
>-- 
>Daniel

-- 
Sorry for being brief. Alternate email is rickleir at yahoo dot com 

Re: Poll: Master-Slave or SolrCloud?

2017-04-24 Thread Erick Erickson
Yeah, this is kind of counter to my expectations too. I guess my
question is whether the same people are responding to the new survey
as the old one. "If it ain't broke" and all that.

Erick

On Mon, Apr 24, 2017 at 7:58 AM, Otis Gospodnetić
 wrote:
> Hi,
>
> I'm really really surprised here.  Back in 2013 we did a poll to see how
> people were running Master-Slave (4.x back then) and SolrCloud was a bit
> more popular than Master-Slave:
> https://sematext.com/blog/2013/02/25/poll-solr-cloud-or-not/
>
> Here is a fresh new poll with pretty much the same question - How do you
> run your Solr?  -
> and guess what?  SolrCloud is *not* at all a lot more prevalent than
> Master-Slave.
>
> We definitely see a lot more SolrCloud used by Sematext Solr
> consulting/support customers, so I'm a bit surprised by the results of this
> poll so far.
>
> Is anyone else surprised by this?  See https://twitter.com/sematext/
> status/854927627748036608
>
> Thanks,
> Otis
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/


Re: SOLR Indexed object delete automatically.

2017-04-24 Thread Erick Erickson
Are you using Data Import Handler (DIH?)? The "clean" option there
removes all the docs before running.

If not, then _someone_ is sending that command, Solr isn't generating
it by itself.

Best,
Erick

On Mon, Apr 24, 2017 at 12:35 PM, Saurav Maulick  wrote:
> No David. this is not in public domain. I have asked server team to check
> incoming traffic IP also but they didn't find any thing.
>
> On Mon, Apr 24, 2017 at 3:28 PM, David Hastings <
> hastings.recurs...@gmail.com> wrote:
>
>> you dont have to log in to the server to send the delete command.  is the
>> computers ip address public?
>>
>>
>> On Mon, Apr 24, 2017 at 3:13 PM, Saurav Maulick 
>> wrote:
>>
>> > thanks Erick for quick replay.
>> >
>> > I the log file we have found that deleteByQuery entry. but i have checked
>> > with windows team they have confirmed no one login the server on that
>> time.
>> >
>> > see the log entry
>> >
>> > 2017-04-22 03:56:24.452 INFO
>> >  (coreZkRegister-1-thread-1-processing-n:XX.XX.XX.XX:1988_solr
>> > x:ocdsolr_shard2_replica1 s:shard2 c:ocdsolr r:core_node1) [c:ocdsolr
>> > s:shard2 r:core_node1 x:ocdsolr_shard2_replica1]
>> > o.a.s.u.p.LogUpdateProcessorFactory [ocdsolr_shard2_replica1]
>> > {add=[10697.45745.12329.21333 (1565267805698260992),
>> > 10697.45745.11554.26776 (1565275671771480064), 10697.45745.13844.32189
>> > (1565282977827520512)],deleteByQuery=*:* (-1565280896874971136)} 0 2353
>> >
>> > On Mon, Apr 24, 2017 at 2:35 PM, Erick Erickson > >
>> > wrote:
>> >
>> > > Solr does not delete files/documents by itself with the exception of
>> > > TTL (Time To Live, since Solr 4.8. See:
>> > > https://lucidworks.com/2014/05/07/document-expiration/).
>> > >
>> > > so unless you've configured TTL, which I doubt you have if you're
>> > > asking this question then I'd suspect several possibilities:
>> > > > You have DIH set to clean every time it's run and it didn't run
>> > > successfully. So the clean step (which deletes all docs) finished but
>> the
>> > > index didn't get repopulated.
>> > >
>> > > > Someone issued a delete-by-query on *:*
>> > >
>> > > > Someone just deleted the index files from your sever accidentally.
>> > >
>> > > Best,
>> > > Erick
>> > >
>> > > On Mon, Apr 24, 2017 at 8:55 AM, Saurav Maulick 
>> > > wrote:
>> > > > Hi Team,
>> > > >
>> > > > We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is
>> Solr
>> > > > automatically delete all indexed files.from last couple of days.
>> > > >
>> > > > Is their any Maxout setting which can cause this issue?
>> > > >
>> > > > This is our production issue and any quick help is highly
>> appreciated.
>> > > >
>> > > > --
>> > > > Thanks and Regards,
>> > > > Saurav Maulick
>> > >
>> >
>> >
>> >
>> > --
>> > Thanks and Regards,
>> > Saurav Maulick
>> >
>>
>
>
>
> --
> Thanks and Regards,
> Saurav Maulick


Re: How to add extra server to Cloud instance?

2017-04-24 Thread Erick Erickson
Just point the new Solr instance at the same Zookeeper ensemble, it'll
be added automatically.

However, you say: "..add one more server to this cloud to have extra
documents indexed and better performance"

What kind of "performance" are we talking here? If you want to add
another _shard_ because you have too many docs for adequate
performance even under light load, you'll have to either define a new
collection or use the SPLITSHARD command.

If you have adequate performance under light load but want to serve
more queries, then just ADDREPLICA.

Best,
Erick

On Mon, Apr 24, 2017 at 3:51 PM, Nilesh Kamani  wrote:
> Hello All,
>
> I created solr cloud instance and collection on Google cloud (Windows
> Instance).
> I used below command.
>
> *solr create_collection -c booleansearch -shards 1 -replicationFactor 1*
>
>
> I would like to add one more server to this cloud to have extra documents
> indexed and better performance.
> Could you please suggest me the steps I need to perform ?
>
> Thanks,
> Nilesh Kamani


How to add extra server to Cloud instance?

2017-04-24 Thread Nilesh Kamani
Hello All,

I created solr cloud instance and collection on Google cloud (Windows
Instance).
I used below command.

*solr create_collection -c booleansearch -shards 1 -replicationFactor 1*


I would like to add one more server to this cloud to have extra documents
indexed and better performance.
Could you please suggest me the steps I need to perform ?

Thanks,
Nilesh Kamani


Troubleshooting solr errors

2017-04-24 Thread Daniel Miller
I'm running Solr 6.4.2 to index my mail server (Dovecot). Searching is 
great - but periodically I have Solr errors. Previously, when an error 
would occur Solr would terminate.  I now have it running as a systemd 
service so it would auto-restart - but it seems like that doesn't solve it.


Some of the log lines include:

2017-04-24 18:18:31.101 ERROR (qtp594427726-30) [   x:dovecot] 
o.a.s.h.RequestHandlerBase org.apache.solr.common.SolrException: 
Exception writing document id 
17697/7db132200dd2df4d2f7b3bc41c5f/dmil...@amfes.com to the index; 
possible analysis error.


2017-04-24 18:18:31.125 ERROR (qtp594427726-32) [   x:dovecot] 
o.a.s.h.RequestHandlerBase org.apache.solr.common.SolrException: Error 
opening new searcher


I don't know what else to provide to try to troubleshoot this.

--
Daniel



Re: SOLR Indexed object delete automatically.

2017-04-24 Thread Saurav Maulick
No David. this is not in public domain. I have asked server team to check
incoming traffic IP also but they didn't find any thing.

On Mon, Apr 24, 2017 at 3:28 PM, David Hastings <
hastings.recurs...@gmail.com> wrote:

> you dont have to log in to the server to send the delete command.  is the
> computers ip address public?
>
>
> On Mon, Apr 24, 2017 at 3:13 PM, Saurav Maulick 
> wrote:
>
> > thanks Erick for quick replay.
> >
> > I the log file we have found that deleteByQuery entry. but i have checked
> > with windows team they have confirmed no one login the server on that
> time.
> >
> > see the log entry
> >
> > 2017-04-22 03:56:24.452 INFO
> >  (coreZkRegister-1-thread-1-processing-n:XX.XX.XX.XX:1988_solr
> > x:ocdsolr_shard2_replica1 s:shard2 c:ocdsolr r:core_node1) [c:ocdsolr
> > s:shard2 r:core_node1 x:ocdsolr_shard2_replica1]
> > o.a.s.u.p.LogUpdateProcessorFactory [ocdsolr_shard2_replica1]
> > {add=[10697.45745.12329.21333 (1565267805698260992),
> > 10697.45745.11554.26776 (1565275671771480064), 10697.45745.13844.32189
> > (1565282977827520512)],deleteByQuery=*:* (-1565280896874971136)} 0 2353
> >
> > On Mon, Apr 24, 2017 at 2:35 PM, Erick Erickson  >
> > wrote:
> >
> > > Solr does not delete files/documents by itself with the exception of
> > > TTL (Time To Live, since Solr 4.8. See:
> > > https://lucidworks.com/2014/05/07/document-expiration/).
> > >
> > > so unless you've configured TTL, which I doubt you have if you're
> > > asking this question then I'd suspect several possibilities:
> > > > You have DIH set to clean every time it's run and it didn't run
> > > successfully. So the clean step (which deletes all docs) finished but
> the
> > > index didn't get repopulated.
> > >
> > > > Someone issued a delete-by-query on *:*
> > >
> > > > Someone just deleted the index files from your sever accidentally.
> > >
> > > Best,
> > > Erick
> > >
> > > On Mon, Apr 24, 2017 at 8:55 AM, Saurav Maulick 
> > > wrote:
> > > > Hi Team,
> > > >
> > > > We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is
> Solr
> > > > automatically delete all indexed files.from last couple of days.
> > > >
> > > > Is their any Maxout setting which can cause this issue?
> > > >
> > > > This is our production issue and any quick help is highly
> appreciated.
> > > >
> > > > --
> > > > Thanks and Regards,
> > > > Saurav Maulick
> > >
> >
> >
> >
> > --
> > Thanks and Regards,
> > Saurav Maulick
> >
>



-- 
Thanks and Regards,
Saurav Maulick


Re: SOLR Indexed object delete automatically.

2017-04-24 Thread David Hastings
you dont have to log in to the server to send the delete command.  is the
computers ip address public?


On Mon, Apr 24, 2017 at 3:13 PM, Saurav Maulick  wrote:

> thanks Erick for quick replay.
>
> I the log file we have found that deleteByQuery entry. but i have checked
> with windows team they have confirmed no one login the server on that time.
>
> see the log entry
>
> 2017-04-22 03:56:24.452 INFO
>  (coreZkRegister-1-thread-1-processing-n:XX.XX.XX.XX:1988_solr
> x:ocdsolr_shard2_replica1 s:shard2 c:ocdsolr r:core_node1) [c:ocdsolr
> s:shard2 r:core_node1 x:ocdsolr_shard2_replica1]
> o.a.s.u.p.LogUpdateProcessorFactory [ocdsolr_shard2_replica1]
> {add=[10697.45745.12329.21333 (1565267805698260992),
> 10697.45745.11554.26776 (1565275671771480064), 10697.45745.13844.32189
> (1565282977827520512)],deleteByQuery=*:* (-1565280896874971136)} 0 2353
>
> On Mon, Apr 24, 2017 at 2:35 PM, Erick Erickson 
> wrote:
>
> > Solr does not delete files/documents by itself with the exception of
> > TTL (Time To Live, since Solr 4.8. See:
> > https://lucidworks.com/2014/05/07/document-expiration/).
> >
> > so unless you've configured TTL, which I doubt you have if you're
> > asking this question then I'd suspect several possibilities:
> > > You have DIH set to clean every time it's run and it didn't run
> > successfully. So the clean step (which deletes all docs) finished but the
> > index didn't get repopulated.
> >
> > > Someone issued a delete-by-query on *:*
> >
> > > Someone just deleted the index files from your sever accidentally.
> >
> > Best,
> > Erick
> >
> > On Mon, Apr 24, 2017 at 8:55 AM, Saurav Maulick 
> > wrote:
> > > Hi Team,
> > >
> > > We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is Solr
> > > automatically delete all indexed files.from last couple of days.
> > >
> > > Is their any Maxout setting which can cause this issue?
> > >
> > > This is our production issue and any quick help is highly appreciated.
> > >
> > > --
> > > Thanks and Regards,
> > > Saurav Maulick
> >
>
>
>
> --
> Thanks and Regards,
> Saurav Maulick
>


Re: SOLR Indexed object delete automatically.

2017-04-24 Thread Saurav Maulick
thanks Erick for quick replay.

I the log file we have found that deleteByQuery entry. but i have checked
with windows team they have confirmed no one login the server on that time.

see the log entry

2017-04-22 03:56:24.452 INFO
 (coreZkRegister-1-thread-1-processing-n:XX.XX.XX.XX:1988_solr
x:ocdsolr_shard2_replica1 s:shard2 c:ocdsolr r:core_node1) [c:ocdsolr
s:shard2 r:core_node1 x:ocdsolr_shard2_replica1]
o.a.s.u.p.LogUpdateProcessorFactory [ocdsolr_shard2_replica1]
{add=[10697.45745.12329.21333 (1565267805698260992),
10697.45745.11554.26776 (1565275671771480064), 10697.45745.13844.32189
(1565282977827520512)],deleteByQuery=*:* (-1565280896874971136)} 0 2353

On Mon, Apr 24, 2017 at 2:35 PM, Erick Erickson 
wrote:

> Solr does not delete files/documents by itself with the exception of
> TTL (Time To Live, since Solr 4.8. See:
> https://lucidworks.com/2014/05/07/document-expiration/).
>
> so unless you've configured TTL, which I doubt you have if you're
> asking this question then I'd suspect several possibilities:
> > You have DIH set to clean every time it's run and it didn't run
> successfully. So the clean step (which deletes all docs) finished but the
> index didn't get repopulated.
>
> > Someone issued a delete-by-query on *:*
>
> > Someone just deleted the index files from your sever accidentally.
>
> Best,
> Erick
>
> On Mon, Apr 24, 2017 at 8:55 AM, Saurav Maulick 
> wrote:
> > Hi Team,
> >
> > We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is Solr
> > automatically delete all indexed files.from last couple of days.
> >
> > Is their any Maxout setting which can cause this issue?
> >
> > This is our production issue and any quick help is highly appreciated.
> >
> > --
> > Thanks and Regards,
> > Saurav Maulick
>



-- 
Thanks and Regards,
Saurav Maulick


Re: SOLR Indexed object delete automatically.

2017-04-24 Thread Erick Erickson
Solr does not delete files/documents by itself with the exception of
TTL (Time To Live, since Solr 4.8. See:
https://lucidworks.com/2014/05/07/document-expiration/).

so unless you've configured TTL, which I doubt you have if you're
asking this question then I'd suspect several possibilities:
> You have DIH set to clean every time it's run and it didn't run successfully. 
> So the clean step (which deletes all docs) finished but the index didn't get 
> repopulated.

> Someone issued a delete-by-query on *:*

> Someone just deleted the index files from your sever accidentally.

Best,
Erick

On Mon, Apr 24, 2017 at 8:55 AM, Saurav Maulick  wrote:
> Hi Team,
>
> We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is Solr
> automatically delete all indexed files.from last couple of days.
>
> Is their any Maxout setting which can cause this issue?
>
> This is our production issue and any quick help is highly appreciated.
>
> --
> Thanks and Regards,
> Saurav Maulick


Multiple tables data aggregation

2017-04-24 Thread solr2020
Hi,

We have set of mongodb collections which has one-many mapping/relation. So
we are trying to create single document in solr from rows of different
mongodb collections at the time of indexing. Can anyone suggest the best
approach to achieve this?

Thanks.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multiple-tables-data-aggregation-tp4331667.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Using multi valued field in solr cloud Graph Traversal Query

2017-04-24 Thread Ganesh M
Hi Joel,

Any idea from when multi value field is supported for gatherNodes ? I am
using version 6.5 ? Is it already there ?

Kindly update,
Ganesh

On Sat, Mar 11, 2017 at 7:51 AM Joel Bernstein  wrote:

> Currently gatherNodes only works on single value fields. You can seed a
> gatherNodes with a facet() expression which works with multi-value fields,
> but after that it only works with single value fields.
>
> So you would have to index the data as a graph like this:
>
> id, concept1, participant1
> id, concept1, participant2
> id, concept2, participant1
> id, concept2, participant3
> id, concept3, participant2
> 
>
> Then you walk the graph like this:
>
> gatherNodes(mydata,
>   gatheNodes(mydata, walk="concept1->conceptID",
> gather="participantID")
>   walk="node->particpantID",
>   gather="conceptID")
>
> This is a two step graph expression:
> 1) Gathers all the participantID's where concept1 is in the conceptID
> field.
> 2) Gathers all the conceptID's for the participantID's gathered in step 1.
>
> Let me know if you have other questions about how to structure the data or
> run the queries.
>
>
>
>
>
>
>
>
> Adding multi-value field support is a fairly high priority so I would
> expect this to be coming in a future release.
>
>
>
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Fri, Mar 10, 2017 at 5:15 PM, Pratik Patel  wrote:
>
> > I am trying to do a graph traversal query using gatherNode function. I am
> > seeding a streaming expression to get some documents and then I am trying
> > to map their ids(conceptid) to a multi valued field "participantIds" and
> > gather nodes.
> >
> > Here is the query I am doing.
> >
> >
> > gatherNodes(collection1,
> > > search(collection1,q="*:*",fl="conceptid",sort="conceptid
> > > asc",fq=storeid:"524efcfd505637004b1f6f24",fq=tags:"Project"),
> > > walk=conceptid->participantIds,
> > > gather="conceptid")
> >
> >
> > The field participantIds is a multi valued field. This is the field which
> > holds connections between the documents. When I execute this query, I get
> > exception as below.
> >
> >
> > { "result-set": { "docs": [ { "EXCEPTION":
> > "java.util.concurrent.ExecutionException: java.lang.RuntimeException:
> > java.io.IOException: java.util.concurrent.ExecutionException:
> > java.io.IOException: -->
> > http://169.254.40.158:8081/solr/collection1_shard1_replica1/:can not
> sort
> > on multivalued field: participantIds", "EOF": true, "RESPONSE_TIME": 15
> } ]
> > } }
> >
> >
> > Does this mean you can not look into multivalued fields in graph
> traversal
> > query? In our solr index, we have documents having "conceptid" field
> which
> > is id and we have participantIds which is a multivalued field storing
> > connections of that documents to other documents. I believe we need to
> have
> > one field in document which stores connections of that document so that
> > graph traversal is possible. If not, what is the other the way to index
> > graph data and use graph traversal. I am trying to explore graph
> traversal
> > and am new to it. Any help would be appreciated.
> >
> > Thanks,
> > Pratik
> >
>


Re: Using multi valued field in solr cloud Graph Traversal Query

2017-04-24 Thread mganeshs
Hi Joel,

Any idea from when multi value field is supported for gatherNodes ? I am
using version 6.5 ? Is it already there ? 

Kindly update,
Ganesh



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Using-multi-valued-field-in-solr-cloud-Graph-Traversal-Query-tp4324379p4331663.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr SQL Subquery Support

2017-04-24 Thread Joel Bernstein
The docs describe the supported features. So if it's not listed in the docs
it's not yet supported. The goal is to gradually increase SQL support.

Joel Bernstein
http://joelsolr.blogspot.com/

On Mon, Apr 24, 2017 at 1:08 PM, Joel Bernstein  wrote:

> No
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Mon, Apr 24, 2017 at 12:57 PM, Furkan KAMACI 
> wrote:
>
>> Hi,
>>
>> Does Solr SQL supports subqueries?
>>
>> Kind Regards,
>> Furkan KAMACI
>>
>
>


Re: Solr SQL Subquery Support

2017-04-24 Thread Joel Bernstein
No

Joel Bernstein
http://joelsolr.blogspot.com/

On Mon, Apr 24, 2017 at 12:57 PM, Furkan KAMACI 
wrote:

> Hi,
>
> Does Solr SQL supports subqueries?
>
> Kind Regards,
> Furkan KAMACI
>


Solr SQL Subquery Support

2017-04-24 Thread Furkan KAMACI
Hi,

Does Solr SQL supports subqueries?

Kind Regards,
Furkan KAMACI


Re: Mixing AND OR conditions with query parameters

2017-04-24 Thread VJ
Thanks MIchael and Erick.

With your valuable comments I finally managed to get the expected result
out of the query.

Regards,
VJ



On Mon, Apr 24, 2017 at 8:17 PM, Erick Erickson 
wrote:

> Michael's comments are spot on, and the deeper thing you should be
> aware of is that Solr query parsing does not implement strict boolean
> logic, see Chris Hostetter's excellent blog here:
> https://lucidworks.com/2011/12/28/why-not-and-or-and-not/
>
> As Michael suggests, parenthesizing carefully can make the parsing
> behave like boolean logic.
>
> Best,
> Erick
>
> On Mon, Apr 24, 2017 at 4:05 AM, Michael Kuhlmann  wrote:
> > Make sure to have a whitespace are the OR operator.
> >
> > The parenthesises should be around the OR query, not including the "fq:"
> > -- this should be outside the parenthesises (which are not necessary at
> > all).
> >
> > What exactly are you expecting?
> >
> > -Michael
> >
> > Am 24.04.2017 um 12:59 schrieb VJ:
> >> Hi All,
> >>
> >> I am facing issues with OR/AND conditions with query parameters:
> >>
> >> fq=cioname:"XYZ" & (fq=attr1:trueORattr2:true)
> >>
> >> The queries are not returning expected results.
> >>
> >> I have tried various permutation and combinations but couldn't get it
> >> working. Any pointers on this?
> >>
> >>
> >>
> >> Regards,
> >> VJ
> >>
> >
>


Re: Numbers in the spellchecker: Mutations based on Levenshtein distance possible?

2017-04-24 Thread alessandro.benedetti
I would start with your spellchecker config and field analysis.
Then we can try to help you!

A shot in the dark is some kind of token filter such the word delimiter
which generates token that you are not expecting in the index ( if you are
using the Direct Spellcheck).
You can use your analysis tool in the admin dashboard or the schema browser
to take a look on what is happening.

Cheers



-
---
Alessandro Benedetti
Search Consultant, R Software Engineer, Director
Sease Ltd. - www.sease.io
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Numbers-in-the-spellchecker-Mutations-based-on-Levenshtein-distance-possible-tp4331635p4331646.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Working with Dates - More DateRangeField Details

2017-04-24 Thread Vincenzo D'Amore
Thanks for your suggestion :)

On Mon, Apr 24, 2017 at 3:08 PM, Shawn Heisey  wrote:

> On 4/24/2017 3:24 AM, Vincenzo D'Amore wrote:
> > https://cwiki.apache.org/confluence/display/solr/Working+with+Dates#
> WorkingwithDates-MoreDateRangeFieldDetails
> >
> > And I found this filter query pretty interesting for me, so I was trying
> to
> > use it with a TrieDateField
> >
> > fq={!field f=Data_Ingresso op=Contains}[2013 TO 2018]
> >
> >  > omitNorms="true" />
> >  > multiValued="false" />
> >
> > But trying the example I get this error:
> >
> > Invalid Date String:'[2013 TO 2018]'
>
> This kind of range works with DateRangeField, it does *not* work with
> TrieDateField.  You're going to have to completely reindex after you
> change the class on your fieldType and reload the collection or core.
>
> https://wiki.apache.org/solr/HowToReindex
>
> Thanks,
> Shawn
>
>


-- 
Vincenzo D'Amore
email: v.dam...@gmail.com
skype: free.dev
mobile: +39 349 8513251


SOLR Indexed object delete automatically.

2017-04-24 Thread Saurav Maulick
Hi Team,

We are using Solr 5.4.1 (in Windows with Zookeper). Our problem is Solr
automatically delete all indexed files.from last couple of days.

Is their any Maxout setting which can cause this issue?

This is our production issue and any quick help is highly appreciated.

-- 
Thanks and Regards,
Saurav Maulick


Numbers in the spellchecker: Mutations based on Levenshtein distance possible?

2017-04-24 Thread Christian Ortner
Hello everyone,

I'm working on a query correction feature based on collations generated by
the spellchecker. This works like a charm, except when numeric tokens are
present in the query. In that case, I don't get any corrections for the
number, although corrections for textual tokens are still made.

Any ideas?

Best regards,
Chris


Re: Modify solr score

2017-04-24 Thread tstusr
We came with a simple solution.

We use  termfreq    and
write a simple processor that counts words for making a boost function that
only calculates the ratio between words that hit terms and the whole field
length.

Some tests are being made, maybe it could solves the problem.

Thanks for your help!



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Modify-solr-score-tp4331300p4331614.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: CloudDescriptor.getNumShards() sometimes returns null

2017-04-24 Thread Markus Jelsma
Sorry, forgot to mention the version, it is on 6.5.0.

Thanks,
Markus

 
 
-Original message-
> From:Erick Erickson 
> Sent: Monday 24th April 2017 16:50
> To: solr-user 
> Subject: Re: CloudDescriptor.getNumShards() sometimes returns null
> 
> What version of Solr? This has been reworked pretty heavily lately no
> 6x and trunk.
> 
> On Mon, Apr 24, 2017 at 2:24 AM, Markus Jelsma
>  wrote:
> > Hi - that (RE: Overseer session expires on multiple collection creation) 
> > was the wrong thread. I meant, any ideas on this one?
> >
> > Many thanks,
> > Markus
> >
> >
> > -Original message-
> >> From:Markus Jelsma 
> >> Sent: Friday 14th April 2017 17:25
> >> To: solr-user 
> >> Subject: CloudDescriptor.getNumShards() sometimes returns null
> >>
> >> Hi - I've got this 2 shard/2 replica cluster. In handler i need the number 
> >> of shards of the cluster.
> >>
> >> cloudDescriptor = core.getCoreDescriptor().getCloudDescriptor();
> >> return cloudDescriptor.getNumShards();
> >>
> >> It is, however, depending on which node is executing this, sometimes null! 
> >> This code only runs on shard leaders. First replica of the first shard 
> >> always returns 2, but second replica of shard one, even when it is the 
> >> leader, always gets me null. Same appears to be true for the second shard.
> >>
> >> I am clearly missing something, any ideas to share?
> >>
> >> Thanks,
> >> Markus
> >>
> 


Poll: Master-Slave or SolrCloud?

2017-04-24 Thread Otis Gospodnetić
Hi,

I'm really really surprised here.  Back in 2013 we did a poll to see how
people were running Master-Slave (4.x back then) and SolrCloud was a bit
more popular than Master-Slave:
https://sematext.com/blog/2013/02/25/poll-solr-cloud-or-not/

Here is a fresh new poll with pretty much the same question - How do you
run your Solr?  -
and guess what?  SolrCloud is *not* at all a lot more prevalent than
Master-Slave.

We definitely see a lot more SolrCloud used by Sematext Solr
consulting/support customers, so I'm a bit surprised by the results of this
poll so far.

Is anyone else surprised by this?  See https://twitter.com/sematext/
status/854927627748036608

Thanks,
Otis
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/


Re: Mixing AND OR conditions with query parameters

2017-04-24 Thread Erick Erickson
Michael's comments are spot on, and the deeper thing you should be
aware of is that Solr query parsing does not implement strict boolean
logic, see Chris Hostetter's excellent blog here:
https://lucidworks.com/2011/12/28/why-not-and-or-and-not/

As Michael suggests, parenthesizing carefully can make the parsing
behave like boolean logic.

Best,
Erick

On Mon, Apr 24, 2017 at 4:05 AM, Michael Kuhlmann  wrote:
> Make sure to have a whitespace are the OR operator.
>
> The parenthesises should be around the OR query, not including the "fq:"
> -- this should be outside the parenthesises (which are not necessary at
> all).
>
> What exactly are you expecting?
>
> -Michael
>
> Am 24.04.2017 um 12:59 schrieb VJ:
>> Hi All,
>>
>> I am facing issues with OR/AND conditions with query parameters:
>>
>> fq=cioname:"XYZ" & (fq=attr1:trueORattr2:true)
>>
>> The queries are not returning expected results.
>>
>> I have tried various permutation and combinations but couldn't get it
>> working. Any pointers on this?
>>
>>
>>
>> Regards,
>> VJ
>>
>


Re: CloudDescriptor.getNumShards() sometimes returns null

2017-04-24 Thread Erick Erickson
What version of Solr? This has been reworked pretty heavily lately no
6x and trunk.

On Mon, Apr 24, 2017 at 2:24 AM, Markus Jelsma
 wrote:
> Hi - that (RE: Overseer session expires on multiple collection creation) was 
> the wrong thread. I meant, any ideas on this one?
>
> Many thanks,
> Markus
>
>
> -Original message-
>> From:Markus Jelsma 
>> Sent: Friday 14th April 2017 17:25
>> To: solr-user 
>> Subject: CloudDescriptor.getNumShards() sometimes returns null
>>
>> Hi - I've got this 2 shard/2 replica cluster. In handler i need the number 
>> of shards of the cluster.
>>
>> cloudDescriptor = core.getCoreDescriptor().getCloudDescriptor();
>> return cloudDescriptor.getNumShards();
>>
>> It is, however, depending on which node is executing this, sometimes null! 
>> This code only runs on shard leaders. First replica of the first shard 
>> always returns 2, but second replica of shard one, even when it is the 
>> leader, always gets me null. Same appears to be true for the second shard.
>>
>> I am clearly missing something, any ideas to share?
>>
>> Thanks,
>> Markus
>>


Re: A problem with depolying:solr6.5 report 404 error

2017-04-24 Thread Shawn Heisey
On 4/24/2017 5:56 AM, Rick Leir wrote:
> David,
> Did you say Tomcat? Try to install without Tomcat. Then, after starting Solr, 
> look in solr.log.
> HTH -- Rick
>
> On April 23, 2017 9:44:30 PM EDT, David  wrote:
>> I have a problem with deploying the solr6.5. my environment are
>> windows7+jdk 8u131+tomcat9.0+solr6.5. java run successfully, tomcat run
>> successfully, solr6.5 has been depolyed. Enter
>> http://localhost:8080/solr/index.html in firefox, and then report 404

+1 to what Rick said.

Although Solr is a webapp, since version 5.0, the only supported
deployment option is the jetty that Solr comes with and the scripts
provided.

https://wiki.apache.org/solr/WhyNoWar

It is possible that in the last two years (since 5.0 was released) Solr
has become reliant on the code mechanisms provided by Jetty and certain
things will no longer function under Tomcat.

You could try the url without "index.html" and see whether it works ...
but you're running an unsupported configuration. To get help, you're
going to have to run it as it was shipped, not in Tomcat.

I tried the URL you mentioned on a Solr 6.5 download running with the
included scripts, and it worked ... but under Tomcat, we can't be sure
of anything.

Thanks,
Shawn



Re: Working with Dates - More DateRangeField Details

2017-04-24 Thread Shawn Heisey
On 4/24/2017 3:24 AM, Vincenzo D'Amore wrote:
> https://cwiki.apache.org/confluence/display/solr/Working+with+Dates#WorkingwithDates-MoreDateRangeFieldDetails
>
> And I found this filter query pretty interesting for me, so I was trying to
> use it with a TrieDateField
>
> fq={!field f=Data_Ingresso op=Contains}[2013 TO 2018]
>
>  omitNorms="true" />
>  multiValued="false" />
>
> But trying the example I get this error:
>
> Invalid Date String:'[2013 TO 2018]'

This kind of range works with DateRangeField, it does *not* work with
TrieDateField.  You're going to have to completely reindex after you
change the class on your fieldType and reload the collection or core.

https://wiki.apache.org/solr/HowToReindex

Thanks,
Shawn



Re: HttpSolrServer default connection timeout and socket timeout

2017-04-24 Thread Shawn Heisey
On 4/24/2017 1:32 AM, Lasitha Wattaladeniya wrote:
> I tried out HttpSolrServer.setConnectionTimeout() method. But im getting
> java.lang.UnsupportedException error in my logs. After a littlebit research
> I found this issue,
>
> https://issues.apache.org/jira/browse/SOLR-6542
>
> Any workarounds for this issue?

I commented on that issue and closed it as Invalid.

If you are creating the HttpClient object external to the Solr client,
you cannot use SolrJ methods to set timeouts.  You must set the timeouts
on the HttpClient object before you give it to SolrJ.  The second code
example on SOLR-6542 has HttpClient code to do this, but then it
proceeds to also try and set the timeouts using SolrJ methods, which
isn't going to work.

If you are running into this exception when you are *not* creating
HttpClient yourself, that's a different problem ... but I can pretty
much guarantee that it won't be addressed in any 4.x or 5.x version. 
HttpSolrServer has been deprecated in the 5.x versions and is completely
gone in the 6.x versions.  It has been replaced by HttpSolrClient.

Thanks,
Shawn



RE: DistributedUpdateProcessorFactory was explicitly disabled from this updateRequestProcessorChain

2017-04-24 Thread Pratik Thaker
Hi Alessandro,

Can you please suggest what should be the correct order of adding processors ?

I am having 5 collections, 6 shards, replication factor 2, 3 nodes on 3 
separate VMs.

Regards,
Pratik Thaker

-Original Message-
From: alessandro.benedetti [mailto:a.benede...@sease.io]
Sent: 21 April 2017 13:38
To: solr-user@lucene.apache.org
Subject: RE: DistributedUpdateProcessorFactory was explicitly disabled from 
this updateRequestProcessorChain

Let's make a quick differentiation between PRE and POST processors in a Solr 
Cloud atchitecture :

 "In a single node, stand-alone Solr, each update is run through all the update 
processors in a chain exactly once. But the behavior of update request 
processors in SolrCloud deserves special consideration. " cit. wiki

*PRE PROCESSORS*
All the processors defined BEFORE the distributedUpdateProcessor happen ONLY on 
the first node that receive the update ( regardless if it is a leader or a 
replica ).

*POST PROCESSORS*
The distributedUpdateProcessor will forward the update request to the the 
correct leader ( or multiple leaders if the request involves more shards), the 
leader will then forward to the replicas.
The leaders and replicas at this point will execute all the update request 
processors defined AFTER the distributedUpdateProcessor.

" Pre-processors and Atomic Updates
Because DistributedUpdateProcessor is responsible for processing Atomic Updates 
into full documents on the leader node, this means that pre-processors which 
are executed only on the forwarding nodes can only operate on the partial 
document. If you have a processor which must process a full document then the 
only choice is to specify it as a post-processor."
wiki

In your example, your chain is definitely messed up, the order is important and 
you want your heavy processing to happen only on the first node.

For better info and clarification:
https://cwiki.apache.org/confluence/display/solr/Schemaless+Mode ( you can find 
here a working alternative to your chain) 
https://cwiki.apache.org/confluence/display/solr/Update+Request+Processors



-
---
Alessandro Benedetti
Search Consultant, R Software Engineer, Director Sease Ltd. - www.sease.io
--
View this message in context: 
http://lucene.472066.n3.nabble.com/DistributedUpdateProcessorFactory-was-explicitly-disabled-from-this-updateRequestProcessorChain-tp4319154p4331215.html
Sent from the Solr - User mailing list archive at Nabble.com.

 The information in this email is confidential and may be legally privileged. 
It is intended solely for the addressee. Access to this email by anyone else is 
unauthorised. If you are not the intended recipient, any disclosure, copying, 
distribution or any action taken or omitted to be taken in reliance on it, is 
prohibited and may be unlawful.


Re: prefix facet performance

2017-04-24 Thread Yonik Seeley
In SimpleFacets.getFacetTermEnumCounts, we seek to the first term
matching the prefix using the index and then for each term after
compare the prefix until it no longer matches.

-Yonik


On Mon, Apr 24, 2017 at 5:04 AM, alessandro.benedetti
 wrote:
> Thanks Yonik and Maria.
> It make sense, if we reduce the number of terms, term enum becomes a very
> good solution.
> @Yonik : do we still check the prefix on the term dictionary one by one, or
> an FST is used to identify the set of candidate terms ?
>
> I will check the code later,
>
> Regards
>
>
>
> -
> ---
> Alessandro Benedetti
> Search Consultant, R Software Engineer, Director
> Sease Ltd. - www.sease.io
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/prefix-facet-performance-tp4330684p4331553.html
> Sent from the Solr - User mailing list archive at Nabble.com.


Re: A problem with depolying:solr6.5 report 404 error

2017-04-24 Thread Rick Leir
David,
Did you say Tomcat? Try to install without Tomcat. Then, after starting Solr, 
look in solr.log.
HTH -- Rick

On April 23, 2017 9:44:30 PM EDT, David  wrote:
>Dear,manager:
>
>
>I have a problem with deploying the solr6.5. my environment are
>windows7+jdk 8u131+tomcat9.0+solr6.5. java run successfully, tomcat run
>successfully, solr6.5 has been depolyed. Enter
>http://localhost:8080/solr/index.html in firefox, and then report 404
>error. The detail is "The origin server did not find a current
>representation for the target resource or is not willing to disclose
>that one exists." Could you tell me why it happened and how to solve
>this problem? Thank you!
>
>
>sincerely yours
>David.Wu

-- 
Sorry for being brief. Alternate email is rickleir at yahoo dot com 

Re: Inconsistent Counts in Cloud at Solr SQL Queries

2017-04-24 Thread Furkan KAMACI
Thanks for the answer! Does facet uses Solr Json requests or new facet API
(which is faster than the old one)?

On Mon, Apr 24, 2017 at 2:18 PM, Joel Bernstein  wrote:

> SQL has two aggregation modes: facet and map_reduce. Facet uses the json
> facet API directly so SOLR-7452 would apply if it hasn't been resolved yet.
> map_reduce always gives accurate results regardless of the cardinality but
> is slower. To increase performance using map_reduce you need to increase
> the size of the cluster (workers, shards, replicas).
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Mon, Apr 24, 2017 at 5:09 AM, Furkan KAMACI 
> wrote:
>
> > Hi,
> >
> > As you know that json facet api returns inconsistent counts in cloud set
> up
> > (SOLR-7452). I would like to learn that is the situation same for Solr
> SQL
> > queries too?
> >
> > Kind Regards,
> > Furkan KAMACI
> >
>


Re: Inconsistent Counts in Cloud at Solr SQL Queries

2017-04-24 Thread Joel Bernstein
SQL has two aggregation modes: facet and map_reduce. Facet uses the json
facet API directly so SOLR-7452 would apply if it hasn't been resolved yet.
map_reduce always gives accurate results regardless of the cardinality but
is slower. To increase performance using map_reduce you need to increase
the size of the cluster (workers, shards, replicas).

Joel Bernstein
http://joelsolr.blogspot.com/

On Mon, Apr 24, 2017 at 5:09 AM, Furkan KAMACI 
wrote:

> Hi,
>
> As you know that json facet api returns inconsistent counts in cloud set up
> (SOLR-7452). I would like to learn that is the situation same for Solr SQL
> queries too?
>
> Kind Regards,
> Furkan KAMACI
>


Re: Mixing AND OR conditions with query parameters

2017-04-24 Thread Michael Kuhlmann
Make sure to have a whitespace are the OR operator.

The parenthesises should be around the OR query, not including the "fq:"
-- this should be outside the parenthesises (which are not necessary at
all).

What exactly are you expecting?

-Michael

Am 24.04.2017 um 12:59 schrieb VJ:
> Hi All,
>
> I am facing issues with OR/AND conditions with query parameters:
>
> fq=cioname:"XYZ" & (fq=attr1:trueORattr2:true)
>
> The queries are not returning expected results.
>
> I have tried various permutation and combinations but couldn't get it
> working. Any pointers on this?
>
>
>
> Regards,
> VJ
>



Mixing AND OR conditions with query parameters

2017-04-24 Thread VJ
Hi All,

I am facing issues with OR/AND conditions with query parameters:

fq=cioname:"XYZ" & (fq=attr1:trueORattr2:true)

The queries are not returning expected results.

I have tried various permutation and combinations but couldn't get it
working. Any pointers on this?



Regards,
VJ


Working with Dates - More DateRangeField Details

2017-04-24 Thread Vincenzo D'Amore
Hi All,

I was reading the Apache Solr Reference Guide - Working with Dates

https://cwiki.apache.org/confluence/display/solr/Working+with+Dates#WorkingwithDates-MoreDateRangeFieldDetails

And I found this filter query pretty interesting for me, so I was trying to
use it with a TrieDateField

fq={!field f=Data_Ingresso op=Contains}[2013 TO 2018]




But trying the example I get this error:

Invalid Date String:'[2013 TO 2018]'

Could someone please help me to understand in what I'm wrong?

Best regards,
Vincenzo

-- 
Vincenzo D'Amore
email: v.dam...@gmail.com
skype: free.dev
mobile: +39 349 8513251


RE: CloudDescriptor.getNumShards() sometimes returns null

2017-04-24 Thread Markus Jelsma
Hi - that (RE: Overseer session expires on multiple collection creation) was 
the wrong thread. I meant, any ideas on this one?

Many thanks,
Markus

 
-Original message-
> From:Markus Jelsma 
> Sent: Friday 14th April 2017 17:25
> To: solr-user 
> Subject: CloudDescriptor.getNumShards() sometimes returns null
> 
> Hi - I've got this 2 shard/2 replica cluster. In handler i need the number of 
> shards of the cluster.
> 
> cloudDescriptor = core.getCoreDescriptor().getCloudDescriptor();
> return cloudDescriptor.getNumShards();
> 
> It is, however, depending on which node is executing this, sometimes null! 
> This code only runs on shard leaders. First replica of the first shard always 
> returns 2, but second replica of shard one, even when it is the leader, 
> always gets me null. Same appears to be true for the second shard.
> 
> I am clearly missing something, any ideas to share?
> 
> Thanks,
> Markus
> 


RE: Overseer session expires on multiple collection creation

2017-04-24 Thread Markus Jelsma
Hi - any ideas on this one?

Many thanks,
Markus
 
-Original message-
> From:apoorvqwerty 
> Sent: Friday 21st April 2017 15:04
> To: solr-user@lucene.apache.org
> Subject: Overseer session expires on multiple collection creation
> 
> Hi, 
> I am trying to create multiple collections with 2 shards and 2 replications
> each.
> After 5-6 successful 
> overseer status response for 5 creations shows 40k requests for
> collection_operations=>am_i_leader which is a bit odd. 
> and I get 
> Am I not supposed to create 8-10 collections one after the other or is there
> some configuration that I'm missing.
> On creation of 8th collection I get following overseer session expired
> exception 
> 
> nExpiredException: KeeperErrorCode = Session expired for
> /overseer/collection-queue-work/qnr-24
>   at org.apache.zookeeper.KeeperException.create(KeeperException.java:127)
>   at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>   at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1045)
>   at
> org.apache.solr.common.cloud.SolrZkClient$5.execute(SolrZkClient.java:322)
>   at
> org.apache.solr.common.cloud.SolrZkClient$5.execute(SolrZkClient.java:319)
>   at
> org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:60)
>   at 
> org.apache.solr.common.cloud.SolrZkClient.exists(SolrZkClient.java:319)
>   at
> org.apache.solr.cloud.OverseerTaskQueue.remove(OverseerTaskQueue.java:93)
>   at
> org.apache.solr.cloud.OverseerTaskProcessor$Runner.markTaskComplete(OverseerTaskProcessor.java:525)
>   at
> org.apache.solr.cloud.OverseerTaskProcessor$Runner.run(OverseerTaskProcessor.java:483)
>   at
> org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229)
>   at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   at java.lang.Thread.run(Thread.java:745)
> 
> 
> 
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Overseer-session-expires-on-multiple-collection-creation-tp4331265.html
> Sent from the Solr - User mailing list archive at Nabble.com.
> 


Inconsistent Counts in Cloud at Solr SQL Queries

2017-04-24 Thread Furkan KAMACI
Hi,

As you know that json facet api returns inconsistent counts in cloud set up
(SOLR-7452). I would like to learn that is the situation same for Solr SQL
queries too?

Kind Regards,
Furkan KAMACI


Re: How to use Wordnet in solr?

2017-04-24 Thread alessandro.benedetti
No no Pablo, what I am saying is that you will need to :

1) download the wordnet file
2) upload it to Solr as a managed resource
3) use the synonym token filter with the wordnet format.

I don't know ( and I don't think it is possible right now) how to configure
a managed resource to fetch data from an online service, maybe could be a
good addition for this specific synonym filter.

Cheers



-
---
Alessandro Benedetti
Search Consultant, R Software Engineer, Director
Sease Ltd. - www.sease.io
--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-use-Wordnet-in-solr-tp4331273p4331554.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: prefix facet performance

2017-04-24 Thread alessandro.benedetti
Thanks Yonik and Maria.
It make sense, if we reduce the number of terms, term enum becomes a very
good solution.
@Yonik : do we still check the prefix on the term dictionary one by one, or
an FST is used to identify the set of candidate terms ?

I will check the code later,

Regards



-
---
Alessandro Benedetti
Search Consultant, R Software Engineer, Director
Sease Ltd. - www.sease.io
--
View this message in context: 
http://lucene.472066.n3.nabble.com/prefix-facet-performance-tp4330684p4331553.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: HttpSolrServer default connection timeout and socket timeout

2017-04-24 Thread Lasitha Wattaladeniya
Hi devs,

I tried out HttpSolrServer.setConnectionTimeout() method. But im getting
java.lang.UnsupportedException error in my logs. After a littlebit research
I found this issue,

https://issues.apache.org/jira/browse/SOLR-6542

Any workarounds for this issue?

Regards,
Lasitha

On 16 Mar 2017 09:48, "Lasitha Wattaladeniya"  wrote:

> Hi devs,
>
> What are the default HttpSolrServer connection timeout and socket timeout
> used?
>
> And what are the recommended values we should use?
>
> My system is having some problems when the load is high, just trying to
> figure out what are the correct values should set
>
> Regards,
> Lasitha
>


Re: Nodes goes down but never recovers.

2017-04-24 Thread Pranaya Behera
Any other solutions for this ?

On Fri, Apr 21, 2017 at 9:42 AM, Pranaya Behera
 wrote:
> Hi Erick,
>   Even if they use different solr.home which I have also
> tested in AWS environment there also is the same problem.
>
> Can someone verify the first message in their local ?
>
> On Fri, Apr 21, 2017 at 2:27 AM, Erick Erickson  
> wrote:
>> Have you looked at the Solr logs on the node you try to bring back up?
>> There are sometimes much more informative messages in the log files.
>> The proverbial "smoking gun" would be messages about write locks.
>>
>> You say they are all using the same solr.home, which is probably the
>> source of a lot of your issues. Take a look at the directory structure
>> after you start up the example and you'll see different -s parameters
>> for each of the instances started on the same machine, so the startup
>> looks something like:
>>
>> bin/solr start -c -z localhost:2181 -p 898$1 -s example/cloud/node1/solr
>> bin/solr start -c -z localhost:2181 -p 898$1 -s example/cloud/node2/solr
>>
>> and the like.
>>
>> Best,
>> Erick
>>
>> On Thu, Apr 20, 2017 at 11:01 AM, Pranaya Behera
>>  wrote:
>>> Hi,
>>>  Can someone from the mailing list also confirm the same findings
>>> ? I am at wit's end on what to do to fix this. Please guide me to
>>> create a patch for the same.
>>>
>>> On Thu, Apr 20, 2017 at 3:13 PM, Pranaya Behera
>>>  wrote:
 Hi,
  Through SolrJ I am trying to upload configsets and create
 collections in my solrcloud.

 Setup:
 1 Standalone zookeeper listening on 2181 port. version 3.4.10
 -- bin/zkServer.sh start
 3 Starting solr nodes. (All running from the same solr.home) version
 6.5.0 and as well in 6.2.1
 -- bin/solr -c -z localhost:2181 -p 8983
 -- bin/solr -c -z localhost:2181 -p 8984
 -- bin/solr -c -z localhost:2181 -p 8985

 After first run of my java application to upload the config and create
 the collections in solr through zookeeper is seemless and working
 fine.
 Here is the clusterstatus after the first run.
 https://gist.github.com/shadow-fox/5874f8b5de93fff0f5bcc8886be81d4d#file-3nodes-json

 Stopped one solr node via:
 -- bin/solr stop -p 8985
 clusterstatus changed to:
 https://gist.github.com/shadow-fox/5874f8b5de93fff0f5bcc8886be81d4d#file-3nodes1down-json

 Till now everything is as expected.

 Here is the remaining part where it confuses me.

 Bring the down node back to life. Clusterstatus changed from 2 node
 down with 1 node not found to 3 node down including the new node that
 just brought up.
 https://gist.github.com/shadow-fox/5874f8b5de93fff0f5bcc8886be81d4d#file-3nodes3down-json
 Expected result should be all the other nodes should be in active mode
 and this one would be recovery mode and then it would be active mode,
 as this node had data before i stopped it using the script.

 Now I added one more node to the cluster via
 -- bin/solr -c -z localhost:2181 -p 8986
 The clusterstatus changed to:
 https://gist.github.com/shadow-fox/5874f8b5de93fff0f5bcc8886be81d4d#file-4node3down-json
 This one just retains the previous state and adds the node to the cluster.


 When bringing up the removed node which was previously in the cluster
 which was registered to the zookeeper and has data about the
 collections be registered as active rather than making every other
 node down ? If so what is the solution to this ?

 When we add more nodes to an existing cluster, how to ensure that it
 also gets the same collections/data i.e. basically synchronizes with
 the other nodes which are present in the node rather than manually
 create collection for that specific node ? As you can see from the
 lastly added node's clusterstate it is there in the live_nodes but
 never got the collections into its data dir.
 Is there any other way to add a node with the existing cluster with
 the cluster data ?

 For the completion here is the code that is used to upload config and
 create collection through CloudSolrClient in Solrj.(Not full code but
 part of it where the operation is happening.)
 https://gist.github.com/shadow-fox/5874f8b5de93fff0f5bcc8886be81d4d#file-code-java
 Thats all there is for a collection to create: upload configsets to
 zookeeper, create collection and reload collection if required.

 This I have tried in my local Mac OS Sierra and also in AWS env which
 same effect.



 --
 Thanks & Regards
 Pranaya PR Behera
>>>
>>>
>>>
>>> --
>>> Thanks & Regards
>>> Pranaya PR Behera
>
>
>
> --
> Thanks & Regards
> Pranaya PR Behera



-- 
Thanks & Regards
Pranaya PR Behera


A problem with depolying:solr6.5 report 404 error

2017-04-24 Thread David
Dear,manager:


   I have a problem with deploying the solr6.5. my environment are 
windows7+jdk 8u131+tomcat9.0+solr6.5. java run successfully, tomcat run 
successfully, solr6.5 has been depolyed. Enter 
http://localhost:8080/solr/index.html in firefox, and then report 404 error. 
The detail is "The origin server did not find a current representation for the 
target resource or is not willing to disclose that one exists." Could you tell 
me why it happened and how to solve this problem? Thank you!


sincerely yours
David.Wu