On 7/30/2015 3:53 AM, Manohar Sripada wrote:
> We have Solr Cloud (version 4.7.2) setup on 64 shards spread across VMs.
> I see my queries to Solr taking exactly 50 sec intermittently (as
> someone said so :P). This happens once in 10 queries. 
> I have enabled log level to TRACE on all the solr nodes. I didn't find
> any issue with the query time on any given shard (max QTime observed on
> a shard is 10 ms).  We ran all the tests related to network and
> everything looks fine there. 
> 
> Whenever the query took 50 sec, I am seeing the below log statements
> for org.eclipse.jetty component. Is this some issue with Jetty?? I could
> see this logs being printed every 11 seconds(/2015-07-24
> 07:06:00, //2015-07-24 07:06:11, ...)/  for 4 times. Attached the
> complete logs during that duration. Can someone please help me here??

<snip>

> /INFO  - 2015-07-24 07:06:00.128;
> org.apache.solr.servlet.SolrDispatchFilter; [admin] webapp=null
> path=/admin/info/logging
> params={_=1437736005493&since=1437734905469&wt=json} status=0 QTime=0

Those logs appear to be caused by someone watching the Logging tab in
the admin UI. This admin UI page refreshes every ten seconds. No queries
are happening during the log you included, only the requests for logging
info. These requests are normally very fast, and in your log, they show
a qtime of zero milliseconds.

64 shards is quite a bit, and as soon as someone talks about a very
large install on virtual machines that is having performance problems, I
suspect that they probably do not have enough resources (memory in
particular) for what they are asking the system to do.

Now it's time for some light reading:

http://wiki.apache.org/solr/SolrPerformanceProblems

Next there are questions. These first bunch of questions are about the
virtual machines themselves, not the host hardware for the virtual machines.

Are you using the jetty (start.jar) included with Solr, or have you
installed Solr into a different jetty?

On the dashboard of the admin UI, in the JVM section, there is an "Args"
parameter, which may have multiple lines. What all is there?

If you add up all the shard replicas on a single virtual machine, how
many docs are there and how much disk space is used by the index data?
Include all replicas in those numbers, even if they duplicate data
that's on another virtual machine.

How much memory does the virtual machine have, and how much of that
memory is allocated to the java heap?

Are all of the virtual machines similar as far as memory config and how
much Solr data they contain?

If you are using a virtual machine platform that you host yourself, then
I need to know how many of these virtual machines are loaded onto each
physical machine, and how much memory that physical machine has. If
you're using AWS, then this question is irrelevant. The allocation of
CPU resources might be important, but it's not as important as memory.

Thanks,
Shawn

Reply via email to