Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Hui Huang
The segfault messages look benign - they are the implicit NULL checks
within JVM.
Did you see a hang or a crash (i.e. tomcat process is gone,
segfaults in kernel log don't count, they are handled by JVM)?
Is this on HT Xeon? I've heard 2.6.5 in FC2 is not stable (frequent
hang) but 2.6.6 is Ok, looks like your case is different. I haven't
got a chance to load FC2 on my machine yet.
regards,
-hui
Marc Heckmann wrote:
Hi,
 I am using j2sdk-1.4.2 on Fedora Core 2 x86_64. when starting tomcat
v4.1.30 (haven't tried any other version), I get segfaults and my
webapps do not run properly at all. This happens with either the 32
bit or native 64 bit versions of the SDK. I have tried on 2 separate
machines with the same results. Here are some of the messages I get:
java[4508]: segfault at 0008 rip 002a98544a78 rsp 007fbfffc1d0 
error 4
java[4520]: segfault at 0008 rip 002a9845f3c2 rsp 4148bf20 
error 4
The same thing happens with Sun's SDK. 

Kernel version 2.6.7-rc2-bk6 as well as 2.6.6. Kernel 2.4.27-rc5 does
not show this behaviour.
Anyone seen this before? Thanks in advance.
-m
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]


Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Juergen Kreileder
Radu-Adrian Popescu <[EMAIL PROTECTED]> writes:

> The error codes come in groups of 1-4 at a time, and they're
> alternate.
>
> What's causing this ?

HotSpot uses SIGSEGV internally.  It does on x86 too, but the x86
kernel doesn't have a segfault logger, so you don't get these messages
there.  IIRC there's no way to disable the x86_64 segfault logger
without recompiling the kernel.

> Oh and when is 1.4.2 64-bit final coming out :) ?

In a few days, we're in the final testing stage.


Juergen

-- 
Juergen Kreileder, Blackdown Java-Linux Team
http://www.blackdown.org/java-linux/java2-status/


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Radu-Adrian Popescu
Marc Heckmann wrote:
Hi,
 I am using j2sdk-1.4.2 on Fedora Core 2 x86_64. when starting tomcat
v4.1.30 (haven't tried any other version), I get segfaults and my
webapps do not run properly at all. This happens with either the 32
bit or native 64 bit versions of the SDK. I have tried on 2 separate
machines with the same results. Here are some of the messages I get:
java[4508]: segfault at 0008 rip 002a98544a78 rsp 007fbfffc1d0 
error 4
java[4520]: segfault at 0008 rip 002a9845f3c2 rsp 4148bf20 
error 4
The same thing happens with Sun's SDK. 

Kernel version 2.6.7-rc2-bk6 as well as 2.6.6. Kernel 2.4.27-rc5 does
not show this behaviour.
Anyone seen this before? Thanks in advance.

I'm running FC2 x86_64 with

Java(TM) 2 Runtime Environment, Standard Edition (build Blackdown-1.4.2-rc1)
Java HotSpot(TM) 64-Bit Server VM (build Blackdown-1.4.2-rc1, mixed mode)>
on kernel
<2.6.5-1.358>
and I get them segfaults too, with error codes 4 & 6. The applications seem to 
work just fine, though.
Jun  9 20:05:04 rpopescu kernel: java[1673]: segfault at 0008 rip 
002a985392a6 rsp 41f94790 error 4
Jun  9 21:35:28 rpopescu kernel: java[2883]: segfault at 4fe8 rip 
57a619e5 rsp 7fe8 error 6

The error codes come in groups of 1-4 at a time, and they're alternate.
What's causing this ?
Oh and when is 1.4.2 64-bit final coming out :) ?
Cheers,
--
Radu-Adrian Popescu
CSA, DBA, Developer
Aldrapay MD
Aldratech Ltd.
+40213212243
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]


Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Radu-Adrian Popescu
Juergen Kreileder wrote:
HotSpot uses SIGSEGV internally.  It does on x86 too, but the x86
kernel doesn't have a segfault logger, so you don't get these messages
there.  IIRC there's no way to disable the x86_64 segfault logger
without recompiling the kernel.

Oh and when is 1.4.2 64-bit final coming out :) ?

In a few days, we're in the final testing stage.
Fabulous, thanks Juergen !
Keep up the excellent work guys!
Regards,
--
Radu-Adrian Popescu
CSA, DBA, Developer
Aldrapay MD
Aldratech Ltd.
+40213212243
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]


Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Marc Heckmann
On Wed, Jun 09, 2004 at 11:58:32PM -0700, Hui Huang wrote:
> The segfault messages look benign - they are the implicit NULL checks
> within JVM.

But look at the following discussion, Andi Kleen one of the x86_64
kernel developers says that the kernel has been fixed not to log the
messages if it is not a real segfault:

http://bugme.osdl.org/show_bug.cgi?id=2839

But maybe they fixed 2.4 and forgot to do so for 2.6.x?

It also seems to me that the Sun 1.4.2 JVM did not function correctly
(1.5.0-beta for x86_64 did actually dump core.) in that the webapp was
not functioning correctly. 

I will have to do more extensive tests using blackdown with my own
code when I have a minute to confirm (or not) this.

I will let you know.

-m


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: segfault on x86_64 (Fedora Core 2) 2.6.x

2004-06-10 Thread Juergen Kreileder
Marc Heckmann <[EMAIL PROTECTED]> writes:

> On Wed, Jun 09, 2004 at 11:58:32PM -0700, Hui Huang wrote:
>> The segfault messages look benign - they are the implicit NULL
>> checks within JVM.
>
> But look at the following discussion, Andi Kleen one of the x86_64
> kernel developers says that the kernel has been fixed not to log the
> messages if it is not a real segfault:
>
> http://bugme.osdl.org/show_bug.cgi?id=2839
>
> But maybe they fixed 2.4 and forgot to do so for 2.6.x?

It looks like it's supposed to be fixed in 2.6 too.  But a quick test
shows that it actually works the other way round, at least in 2.6.6:
Only catched segfaults get logged!

Here's a patch which fixed the problem for me;

--- arch/x86_64/mm/fault.c.orig 2004-06-10 19:51:45.0 +0200
+++ arch/x86_64/mm/fault.c  2004-06-10 20:38:38.0 +0200
@@ -210,11 +210,11 @@ static int is_errata93(struct pt_regs *r
 
 int unhandled_signal(struct task_struct *tsk, int sig)
 {
-   /* Warn for strace, but not for gdb */
-   if ((tsk->ptrace & (PT_PTRACED|PT_TRACESYSGOOD)) == PT_PTRACED)
-   return 0;
-   return (tsk->sighand->action[sig-1].sa.sa_handler == SIG_IGN) ||
-   (tsk->sighand->action[sig-1].sa.sa_handler == SIG_DFL);
+   /* I'm not sure about PT_TRACESYSGOOD. Is gdb supposed
+  to use PTRACE_O_TRACESYSGOOD?  Mine doesn't. */
+   return !(tsk->ptrace & PT_PTRACED) &&
+   ((tsk->sighand->action[sig-1].sa.sa_handler == SIG_IGN) ||
+(tsk->sighand->action[sig-1].sa.sa_handler == SIG_DFL));
 }
 
 int page_fault_trace; 
@@ -374,7 +374,7 @@ bad_area_nosemaphore:
(address >> 32))
return;
 
-   if (exception_trace && !unhandled_signal(tsk, SIGSEGV)) { 
+   if (exception_trace && unhandled_signal(tsk, SIGSEGV)) { 
printk(KERN_INFO 
   "%s[%d]: segfault at %016lx rip %016lx rsp %016lx error %lx\n",
tsk->comm, tsk->pid, address, regs->rip,


> It also seems to me that the Sun 1.4.2 JVM did not function
> correctly (1.5.0-beta for x86_64 did actually dump core.) in that
> the webapp was not functioning correctly.

32-bit VMs should work OK (unless you use noexec32=all,on; only
Blackdown 1.4.2 works fine with that).
As for 64-bit VMs, please wait a few days: 1.4.2-fcs will fix quite a few
x86_64 specific bugs.


Juergen

-- 
Juergen Kreileder, Blackdown Java-Linux Team
http://www.blackdown.org/java-linux/java2-status/


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Help on JVM hang

2004-06-10 Thread Veda N Ponnusamy




Hi 
folks,
    I'm posting this, since I know I can get help only from this community. 
My application is a heavily multi-threaded java application which is runs 
in Linux with the following configuration
 
OS - 
RH9
Kernel - 
2.4.20-30.9smp
glibc - 
glibc-2.3.2-27.9.7
JVM - SUN 
1.4.2
 
My 
applications HANGS after runnning for some time. Do you any work arounds for 
this problem
 
"Thread-11" prio=1 tid=0x6a920368 nid=0x999 
runnable [6bc7b000..6bc7b854]    at 
java.io.ExpiringCache.cleanup(ExpiringCache.java:94)    
at 
java.io.ExpiringCache.get(ExpiringCache.java:51)    
- locked <0x46e82cb0> (a 
java.io.ExpiringCache)    at 
java.io.UnixFileSystem.canonicalize(UnixFileSystem.java:137)    
at 
java.io.File.getCanonicalPath(File.java:513)    
at 
java.io.FilePermission$1.run(FilePermission.java:218)    
at java.security.AccessController.doPrivileged(Native 
Method)    at 
java.io.FilePermission.init(FilePermission.java:212)    
at 
java.io.FilePermission.(FilePermission.java:264)    
at 
java.lang.SecurityManager.checkRead(SecurityManager.java:863)    
at java.io.File.exists(File.java:678)
 
I dont get 
this problem if I run with LD_ASSUME_KERNEL=2.2.5 setting. But the applications 
runs very slow( around 50% slower). Does anyone has come accross this problem 
?
 
Reg
Veda