Yeah I tried looking for errors, but there aren't any that I can tell, it just starts eating up all the cpu until the system locks up.

In one log previous I see some SocketTimeoutExceptions, but those are fairly normal when one of my app servers restarts, otherwise I don't get anything useful =/

Aaron Smuts wrote:
Look for errors in the log and send the relevant
sections.

Aaron

--- Joshua Szmajda <[EMAIL PROTECTED]> wrote:

Got another one here:
http://loki.ws/~josh/restart-20080411.tar.bz2

server should work..

Niall Gallagher wrote:
Hi Josh,

I couldn't access your link, connection refused.
I'll be out of the
office until next wednesday so I hope you have
some success by then.
Kind regards,
Niall

On Thu, 2008-04-10 at 15:32 -0400, Joshua Szmajda
wrote:
Ok, caught one! The logs are pretty big, so I put
them up here:
http://loki.ws/~josh/restart-20080410.tar.bz2

I'm really not sure what caused it, it seems to
have happened a little
more quickly than usual.

Does it seem to be the GC? I can't tell, I can
try adding in those GC
tuning things but I don't want to jump the gun
and change too many
variables at once. I'll add in the tracing Al
suggested though at least.
Thanks!
-Josh

Joshua Szmajda wrote:
I'd been deleting the logs, so I don't have one
right now ><. I did
change my scripts to save them though. As soon
as it happens again
I'll have some data. It seems to take about a
week or so of running
from a fresh start before I start to get
problems.
Niall: thanks for the explanation. I figured
they were probably Byte
arrays, but then I saw the Strings and that
threw me off :).
Anyway as soon as I get some real data I'll post
it to the list.
Thanks all!
-Josh

Aaron Smuts wrote:
Do you have any of the cache logs when this is
happening?

I would turn the memory shrinker off (set the
property
to false), as a start.  I generally don't run
with the
memory shrinker on.  But I'm shooting in the
dark.
Aaron


--- Joshua Szmajda <[EMAIL PROTECTED]> wrote:

Ahh yes of course, it was the user
requirement. Now
I have a nice bunch of data. This is
interesting, but I'm not sure what
the [B class is:

num   #instances    #bytes  class name
--------------------------------------
  1:     31419   284852480  [B
  2:      2277    19760264  [I
  3:     57834     3865240  [C
  4:     29628     1896192
org.apache.jcs.engine.ElementAttributes
  5:     57838     1388112  java.lang.String
...

Niall Gallagher wrote:
Hmm :D

I just did a bit of digging. I've used this
script
on a few of our
servers in the past (32 and 64bit server
VMs), but
I just found a server
which gave me the exact same error message
you
got. That server it turns
out runs Java under a different user account
to
the one I was logged
into however.

Try running the script from the exact same
user
account the JVM process
is running from. Even running from root
doesn't
work didn't work for me
on that server, it had to be exact same user
account, which is
surprising.

By the way those tools are documented here:

http://java.sun.com/j2se/1.5.0/docs/tooldocs/share/jmap.html
and

http://java.sun.com/j2se/1.5.0/docs/tooldocs/share/jstack.html
-basically they're supposed to work on most
platforms except Windows and
Linux Itanium so unless you've got Itanium
cpus it
should work for you.
On Wed, 2008-04-09 at 14:44 -0400, Joshua
Szmajda
wrote:
Hey Niall,

Thanks for your script, but I'm getting
these
errors:
./capture-diagnostics.sh
RemoteCacheServerFactory

=== message truncated ===


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to