Hi Andrew, yes I still have the problem. The problem only shows on 32bit installs, on a 64bit SLES11 SP1 with the same kernel version level, AFS runs without problems.
When I enable verbose mode, here's what I get before the machine dies: testinstall:~ # rcopenafs-client start Starting OpenAFS Client afsd: enabling dynamically allocated vcaches afsd: My home cell is 'zurich.ibm.com' afsd: chunkSize autotuned to 16 SScall(137, 28, 17)=0 afsd: Forking rx listener daemon. afsd: Forking rx callback listener. afsd: Forking rxevent daemon. SScall(137, 28, 36)=0 afsd: Calling AFSOP_CACHEINIT: 300 stat cache entries, 1854 optimum cache files, 118685 blocks in the cache, flags = 0x1, dcache entries 1854 SScall(137, 28, 19)=0 SScall(137, 28, 0)=0 SScall(137, 28, 48)=0 SScall(137, 28, 6)=0 afsd: Sweeping workstation's AFS cache directory. afsd: Using memory cache, not swept SScall(137, 28, 34)=0 afsd: Enabling fakestat support in kernel. SScall(137, 28, 33)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 29)=0 SScall(137, 28, 35)=0 afsd: Forking AFS daemon. afsd: Forking Check Server Daemon. afsd: Forking 2 background daemons. SScall(137, 28, 2)=0 SScall(137, 28, 2)=0 SScall(137, 28, 4)=0 SScall(137, 28, 1)=0 SScall(137, 28, 100)=0 afsd: All AFS daemons started. afsd: Forking trunc-cache daemon. afsd: Mounting the AFS root on '/afs', flags: 0. >From that point on, the machine seems completely frozen until it eventually reboots itself. I digged into the /var/log/messages and found something: Based on this I changed the client settings for the memcache. I went from "AUTOMATIC" to "65536" (64MB) and that worked. Going up to 96MB worked, 128MB crashed the machine again. Tried 97MB and it crashed as well. 96MB seems to be the limit. I also verified that the problem is connected to the memcache. Using a dedicated partition for the cache the problem wasn't there. System: SLES11 SP1, PAE kernel 2.6.32.13-0.4 OpenAFS: 1.4.12.1 , the RPMs I'm using come from the SuSE tree ( ftp://ftp5.gwdg.de/pub/opensuse/repositories/filesystems/SLE_11/i586/openafs-* ) Hope that helps. Regards Frank ---------------------------------------------------------------------------- Dr. Frank Bagehorn Manager ZRL Information Services Sr. IT Architect / Certified IT Architect IBM Zurich Research Lab. Saeumerstr. 4 CH-8803 Rueschlikon Switzerland ---------------------------------------------------------------------------- SMTP: [email protected] Notes: Frank Bagehorn/Zurich/i...@ibmch phone: ++41 (044) 724 83 23 fax: ++41 (044) 724 89 59 From: Andrew Deason <[email protected]> To: Frank Bagehorn <[email protected]> Cc: [email protected] Date: 22.07.10 21:34 Subject: Re: afsd crashes SLES11 SP1 system On Fri, 21 May 2010 18:21:23 +0200 Frank Bagehorn <[email protected]> wrote: > What should I do to gather more useful information about the possible > root cause of this ? If you are still having this problem... You do not get a panic message or anything like that? Any message you see or that is logged would be helpful. It would be helpful to trigger a core dump of the machine, and see what various processes are waiting for. Or if you can, get a trace of all processes running on the machine; this can usually be done with 'echo t > /proc/sysrq-trigger'. If the machine is not usable, it still may be possible to trigger by hitting alt-sysrq-t. -- Andrew Deason [email protected]
afs_error.log
Description: Binary data
smime.p7s
Description: S/MIME Cryptographic Signature
