Hi again On Mon, Apr 28, 2008 at 03:28:06PM -0400, Yaroslav Halchenko wrote: > > I'm not perfectly sure but some things that I suspect is the problem is > > that the > > number of open files, open sockets, number of processes os something > > similar has > > reached its limit. > > The reason is that you get ERESTARTNOINTR. > thanks for sharing the knowledge ;-) I guess I just need to figure out > how to monitor all the resources from a single point...
::) > > Have you seen this on several systems or just one? > unfortunatly I use VNC primarily on that only box, thus I didn't see it > anywhere else. If only we could figure out the loop where it gets to > 100% I guess I could figure out what rejection does it get (ie what > resource is the problem) To me it seems more like you have really problematic libc or kernel. Because I see from your information that you have provided that you can get this problem in quite a few situation. Are you sure that you do not have a broken installation like buggy kernel or libc? I mean it should not really hang in fork... Best regards, // Ola > > Best regards, > > > // Ola > > > > Sorry for being so anal... stalled once again today. From gdb now it is > > > at fork and > > > never actually exits it :-/ If someone could build it with > > > > Loaded symbols for /lib64/ld-linux-x86-64.so.2 > > > 0x00002b68df98cee2 in fork () from /lib/libc.so.6 > > > (gdb) bt > > > #0 0x00002b68df98cee2 in fork () from /lib/libc.so.6 > > > #1 0x000000000043cd90 in Popen () > > > #2 0x000000000043e884 in LoadAuthorization () > > > #3 0x000000000043ea76 in CheckAuthorization () > > > #4 0x0000000000439a25 in ClientAuthorized () > > > #5 0x000000000041e396 in ProcEstablishConnection () > > > #6 0x0000000000424672 in Dispatch () > > > #7 0x000000000040b145 in main () > > > (gdb) finish > > > Run till exit from #0 0x00002b68df98cee2 in fork () from /lib/libc.so.6 > > > > Program received signal SIGINT, Interrupt. > > > 0x00002b68df98cee2 in fork () from /lib/libc.so.6 > > > (gdb) bt > > > #0 0x00002b68df98cee2 in fork () from /lib/libc.so.6 > > > > strace was busy with > > > 14892 rt_sigreturn(0xe) = 56 > > > 14892 clone(child_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > child_tidptr=0x2b68dfb39160) = ? ERESTARTNOINTR (To be > > > restarted)ld_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD > > > 14892 --- SIGALRM (Alarm clock) @ 0 (0) --- > > > 14892 rt_sigreturn(0xe) = 56 > > > 14892 clone(child_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > child_tidptr=0x2b68dfb39160) = ? ERESTARTNOINTR (To be > > > restarted)ld_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD > > > 14892 --- SIGALRM (Alarm clock) @ 0 (0) --- > > > 14892 rt_sigreturn(0xe) = 56 > > > 14892 clone(child_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > child_tidptr=0x2b68dfb39160) = ? ERESTARTNOINTR (To be > > > restarted)nfinished ...> > > > 14892 --- SIGALRM (Alarm clock) @ 0 (0) --- > > > 14892 rt_sigreturn(0xe) = 56 > > > 14892 clone(child_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > child_tidptr=0x2b68dfb39160) = ? ERESTARTNOINTR (To be restarted) > > > 14892 --- SIGALRM (Alarm clock) @ 0 (0) --- > > > 14892 rt_sigreturn(0xe) = 56 > > > 14892 clone(child_stack=0, > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > child_tidptr=0x2b68dfb39160) = ? ERESTARTNOINTR (To be restarted) > > > > > It would so great if there is a vnc4server-dbg ;-))) > > > > BTW -- last line in .log was due to our inserted debug line > > > Popen called with command='cat /home/yoh/.Xauthority' type='r' as > > > arguments > > > > but I am not sure if that wasn't from original login moment earlier in > > > the morning > > > > > On Mon, 21 Apr 2008, Ola Lundqvist wrote: > > > > > stracing was showing lots of getttimeoftheday or whatever that syscall > > > > > is. Today it was different: > > > > > 21162 rt_sigreturn(0xe) = 56 > > > > > 21162 clone(child_stack=0, > > > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > > > child_tidptr=0x2ad7a050a160) = ? ERESTARTNOINTR (To be restarted) > > > > > 21162 --- SIGALRM (Alarm clock) @ 0 (0) --- > > > > > 21162 rt_sigreturn(0xe) = 56 > > > > > 21162 clone(child_stack=0, > > > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, > > > > > child_tidptr=0x2ad7a050a160) = ? ERESTARTNOINTR (To be restarted) > > > > > ... > > > > Hmm. To me it looks that we are out of resources... > > > > -- > > > Yaroslav Halchenko > > > Research Assistant, Psychology Department, Rutgers-Newark > > > Student Ph.D. @ CS Dept. NJIT > > > Office: (973) 353-5440x263 | FWD: 82823 | Fax: (973) 353-1171 > > > 101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102 > > > WWW: http://www.linkedin.com/in/yarik > -- > Yaroslav Halchenko > Research Assistant, Psychology Department, Rutgers-Newark > Student Ph.D. @ CS Dept. NJIT > Office: (973) 353-5440x263 | FWD: 82823 | Fax: (973) 353-1171 > 101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102 > WWW: http://www.linkedin.com/in/yarik > -- --- Inguza Technology AB --- MSc in Information Technology ---- / [EMAIL PROTECTED] Annebergsslingan 37 \ | [EMAIL PROTECTED] 654 65 KARLSTAD | | http://inguza.com/ Mobile: +46 (0)70-332 1551 | \ gpg/f.p.: 7090 A92B 18FE 7994 0C36 4FE4 18A1 B1CF 0FE5 3DD9 / --------------------------------------------------------------- -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

