Privet Yaroslav

On Wed, Jul 04, 2007 at 09:58:52AM -0400, Yaroslav Halchenko wrote:
> Here are more details... I can't figure out on how to debug it more
> productively -- please advise... it stalls again and again... 

According to the backtrace the problem seems to be in:
unix/xc/programs/Xserver/os/utils.c function Popen or in the call to
that function in unix/xc/programs/Xserver/os/auth.c.

According to the strace you get ERESTARTNOINTR. However I can not
understand why you get this more than one time as the loop is after
the call to Popen (that in turn call fork). Maybe the problem is in
fork in libc6? That would explain why you get the same kind of
behaviour in screen. If you could add debugging output in this
area to see if Popen is called many times, and with what arguments
that would be good information.

For example:
fprintf(stderr, "Popen called with '%s' as argument.", buf);

And similar in the Popen function. If the fork call is that hangs
then we have to forward this bug to libc, or maybe kernel.

What more that makes me think that this may be a kernel problem
is that according to the kernel header file the ERESTARTNOINTR should
never be seen by user programs.

http://www.gelato.unsw.edu.au/lxr/source/include/linux/errno.h#L10

Regards,

// Ola



> at server:
> 
> 22991 yoh       25   0  321m 311m 5024 R  100  3.9   1:28.91 Xvnc4 
> 
> gdb:
> Attaching to process 22991
> Reading symbols from /usr/bin/Xvnc4...done.
> Using host libthread_db library "/lib/libthread_db.so.1".
> Reading symbols from /usr/lib/libz.so.1...done.
> Loaded symbols for /usr/lib/libz.so.1
> Reading symbols from /usr/lib/libstdc++.so.6...done.
> Loaded symbols for /usr/lib/libstdc++.so.6
> Reading symbols from /lib/libm.so.6...done.
> Loaded symbols for /lib/libm.so.6
> Reading symbols from /lib/libgcc_s.so.1...done.
> Loaded symbols for /lib/libgcc_s.so.1
> Reading symbols from /lib/libc.so.6...done.
> Loaded symbols for /lib/libc.so.6
> Reading symbols from /lib/ld-linux-x86-64.so.2...done.
> Loaded symbols for /lib64/ld-linux-x86-64.so.2
> 0x00002b39af69cf12 in fork () from /lib/libc.so.6
> (gdb) bt
> #0  0x00002b39af69cf12 in fork () from /lib/libc.so.6
> #1  0x000000000043cd73 in Popen ()
> #2  0x000000000043e864 in LoadAuthorization ()
> #3  0x000000000043ea46 in CheckAuthorization ()
> #4  0x0000000000439a25 in ClientAuthorized ()
> #5  0x000000000041e396 in ProcEstablishConnection ()
> #6  0x0000000000424672 in Dispatch ()
> #7  0x000000000040b145 in main ()
> 
> 
> stracing it gave the same and then crashed when I Ctrl-C'd strace.
> 
> in the ravana:1.log
> 
> Mon Jul  2 18:36:57 2007
>  Connections: accepted: 127.0.0.1::35871
>  SConnection: Client needs protocol version 3.8
>  SConnection: Client requests security type VncAuth(2)
> 
> Mon Jul  2 18:36:59 2007
>  VNCSConnST:  Server default pixel format depth 16 (16bpp) little-endian 
> rgb565
>  VNCSConnST:  Client pixel format depth 6 (8bpp) rgb222
> 
> Mon Jul  2 18:37:00 2007
>  VNCSConnST:  Client pixel format depth 24 (32bpp) little-endian rgb888
> /usr/lib/WindowMaker/WindowMaker warning: trying to create application for 2e
> XIO:  fatal IO error 104 (Connection reset by peer) on X server ":1.0"
>       after 898740 requests (846076 known processed) with 0 events remaining.
> XIO:  fatal IO error 104 (Connection reset by peer) on X server ":1.0"
>       after 65187 requests (146 known processed) with 0 events remaining.
> X connection to :1.0 broken (explicit kill or server shutdown).
> XIO:  fatal IO error 104 (Connection reset by peer) on X server ":1.0"
>       after 845469 requests (780452 known processed) with 0 events remaining.
> XIO:  fatal IO error 104 (Connection reset by peer) on X server ":1.0"
>      O:  fatal IO error 104 (Connection reset by peer) on X server ":1.0"
>       after 1072209 requests (1048321 known processed) with 0 events 
> remaining.
> X connection to :1.0 broken (explicit kill or server shutdown).
> xterm:  fatal IO error 32 (Broken pipe) or KillClient on X server ":1.0"
> X connection to :1.0 broken (explicit kill or server shutdown).
> X connection to :1.0 broken (explicit kill or server shutdown).
> after 31243983 requests (31194559 known processed) with 0 events remaining
> 
> -- 
>                                   .-.
> =------------------------------   /v\  ----------------------------=
> Keep in touch                    // \\     (yoh@|www.)onerussian.com
> Yaroslav Halchenko              /(   )\               ICQ#: 60653192
>                    Linux User    ^^-^^    [175555]
> 
> 
> 
> 

-- 
 --------------------- Ola Lundqvist ---------------------------
/  [EMAIL PROTECTED]                     Annebergsslingan 37      \
|  [EMAIL PROTECTED]                     654 65 KARLSTAD          |
|  http://opalsys.net/                 +46 (0)70-332 1551       |
\  gpg/f.p.: 7090 A92B 18FE 7994 0C36  4FE4 18A1 B1CF 0FE5 3DD9 /
 ---------------------------------------------------------------


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to