AW: Solr Hangs During Updates for over 10 minutes

2013-07-10 Thread Jed Glazner
Hi Shawn this code is für the solrj lib which we already use.

I'm talking about solr s internal communication from leader to replica via the 
DistributedCmdUpdate class.  I want to force the leader to time out after a 
fixed period instead of waiting for 15 minutes für the server to figure out the 
other end of the socket was closed.I don't know of any flags or settings in 
the solrconfig.xml to do this or if it's even possible with out modifying 
source code.

Jed

Von Samsung Mobile gesendet



 Ursprüngliche Nachricht 
Von: Shawn Heisey s...@elyograg.org
Datum: 07.10.2013 17:35 (GMT+01:00)
An: solr-user@lucene.apache.org
Betreff: Re: Solr Hangs During Updates for over 10 minutes


On 7/10/2013 6:57 AM, Jed Glazner wrote:
 So we'll do what we can quickly to see if we can 'band-aid' the problem
 until we can upgrade to solr 4.4  Speaking of band-aids - does anyone know
 of a way to change the socket timeout/connection timeout for distributed
 updates?

If you need to change HttpClient parameters for CloudSolrServer, here's
how you can do it:

String zkHost =
zk1.REDACTED.com:2181,zk2.REDACTED.com:2181,zk3.REDACTED.com:2181/chroot;
ModifiableSolrParams params = new ModifiableSolrParams();
params.set(HttpClientUtil.PROP_MAX_CONNECTIONS, 1000);
params.set(HttpClientUtil.PROP_MAX_CONNECTIONS_PER_HOST, 200);
params.set(HttpClientUtil.PROP_SO_TIMEOUT, 30);
params.set(HttpClientUtil.PROP_CONNECTION_TIMEOUT, 5000);
HttpClient client = HttpClientUtil.createClient(params);
ResponseParser parser = new BinaryResponseParser();
LBHttpSolrServer lbServer = new LBHttpSolrServer(client, parser);
CloudSolrServer server = new CloudSolrServer(zkHost, lbServer);

Thanks,
Shawn



AW: Solr Hangs During Updates for over 10 minutes

2013-07-09 Thread Jed Glazner
Hi Shawn,

I have been trying to duplicate this problem without success for the last 2 
weeks which is one reason I'm getting flustered.   It seems reasonable to be 
able to duplicate it but I can't.

 We do have a story to upgrade but that is still weeks if not months before 
that gets rolled out to production.

We have another cluster running the same version but with 8 shards and 8 
replicas with each shard at 100gb and more load and more indexing requests 
without this problem but we send docs in batches here and all fields are 
stored.   Where as the trouble index has only 1 or 2 stored fields and only 
send docs 1 at a time.

Could that have anything to do with it?

Jed


Von Samsung Mobile gesendet



 Ursprüngliche Nachricht 
Von: Shawn Heisey s...@elyograg.org
Datum: 07.09.2013 18:33 (GMT+01:00)
An: solr-user@lucene.apache.org
Betreff: Re: Solr Hangs During Updates for over 10 minutes


On 7/9/2013 9:50 AM, Jed Glazner wrote:
 I'll give you the high level before delving deep into setup etc. I have been 
 struggeling at work with a seemingly random problem when solr will hang for 
 10-15 minutes during updates.  This outage always seems to immediately be 
 proceeded by an EOF exception on  the replica.  Then 10-15 minutes later we 
 see an exception on the leader for a socket timeout to the replica.  The 
 leader will then tell the replica to recover which in most cases it does and 
 then the outage is over.

 Here are the setup details:

 We are currently using Solr 4.0.0 with an external ZK ensemble of 5 machines.

After 4.0.0 was released, a *lot* of problems with SolrCloud surfaced
and have since been fixed.  You're five releases and about nine months
behind what's current.  My recommendation: Upgrade to 4.3.1, ensure your
configuration is up to date with changes to the example config between
4.0.0 and 4.3.1, and reindex.  Ideally, you should set up a 4.0.0
testbed, duplicate your current problem, and upgrade the testbed to see
if the problem goes away.  A testbed will also give you practice for a
smooth upgrade of your production system.

Thanks,
Shawn