Spell Check and Privacy

2015-10-12 Thread Arnon Yogev
Hi, Our system supports many users from different organizations and with different ACLs. We consider adding a spell check ("did you mean") functionality using DirectSolrSpellChecker. However, a privacy concern was raised, as this might lead to private information being revealed between users

Socket \ Connection Timeout Values

2015-09-03 Thread Arnon Yogev
Hi, I wanted to ask about the implications of different timeout values one can use. For example: >From what I see in the code, the default socket timeout value for Solr is 0. Does that mean Solr nodes will wait to update \ receive update from each other without any timeout? In other words,

Re: Solr cloud hangs, log4j contention issue observed

2015-09-02 Thread Arnon Yogev
cloud hangs, log4j contention issue observed On 9/1/2015 12:53 AM, Arnon Yogev wrote: > We have a Solr cloud (4.7) consisting of 5 servers. > At some point we noticed that one of the servers had a very high CPU and > was not responding. A few minutes later, the other 4 servers we

Solr cloud hangs, log4j contention issue observed

2015-09-01 Thread Arnon Yogev
We have a Solr cloud (4.7) consisting of 5 servers. At some point we noticed that one of the servers had a very high CPU and was not responding. A few minutes later, the other 4 servers were responding very slowly. A restart was required. Looking at the Solr logs, we mainly saw symptoms, i.e.

Result Grouping: Number of results in group is not according to specs

2015-08-17 Thread Arnon Yogev
When using result grouping, Solr specs state the following about the rows and group.limit params: rows - The number of groups to return. group.limit - Number of rows to return in each group. We are using Solr cloud with a single collection and 64 shards. When grouping by field (i.e. using the

Re: Implicit Router Configurations

2015-06-24 Thread Arnon Yogev
Router Configurations Please raise a JIRA for this, I can see why this would occur. You can manually change the clusterstate.json file when this happens as a stop-gap, I'd have all the Solr instances down when doing this though. Best, Erick On Tue, Jun 23, 2015 at 8:19 AM, Arnon Yogev arn

Implicit Router Configurations

2015-06-23 Thread Arnon Yogev
We have a use case where documents are indexed in shards according to a specific field (shard per user), and the number of shards is unknown when creating the collection. For that purpose we use the implicit router and define router.field=user. From what we've seen, the only way to define an

Re: Limitation on Collections Number

2015-06-15 Thread Arnon Yogev
to simply do a proof of concept implementation to test for your particular data model and data values. -- Jack Krupansky On Sun, Jun 14, 2015 at 7:31 AM, Arnon Yogev arn...@il.ibm.com wrote: We're running some tests on Solr and would like to have a deeper understanding

Limitation on Collections Number

2015-06-14 Thread Arnon Yogev
We're running some tests on Solr and would like to have a deeper understanding of its limitations. Specifically, We have tens of millions of documents (say 50M) and are comparing several #collections X #docs_per_collection configurations. For example, we could have a single collection with 50M