Hi!

In a highly loaded environment I get the error

            ViceLog(0,("CB: WhoAreYou failed for %s:%d, error %d\n", 
                       hoststr, ntohs(host->port), code));

in FileLog. The particular stress test to reproduce this reliably is to come
from the following system as a client:

shadm:~ # rpm -qa|grep openafs
openafs-1.2.11-0
openafs-client-1.2.11-0
openafs-devel-1.2.11-0
openafs-server-1.2.11-0
shadm:~ # uname -a
Linux shadm 2.4.21-138-smp #1 SMP Fri Oct 31 00:51:31 UTC 2003 i686 unknown
shadm:~ # cat /etc/SuSE-release 
SuSE SLES-8 (i386)
VERSION = 8.1
shadm:~ # 

I precreate 200 processes under 200 different uid's with different tokens. 

Then with a single signal I send them into a chdir to /afs/cell. Looking at the
sniff shows that the client does not respond properly to the WhoAreYou callback
calls. I know this is an artificial workload, but the only way to reliably
reproduce the problem.

Looking at viced/host.c shows that it is not a fatal error if this call is not
answered properly. Replacing 

        code = RXAFSCB_WhoAreYou(host->callback_rxcon, &interf);

with 

        code = RXGEN_OPCODE;

in viced/host.c gives some more messages in FileLog, but my problem goes away.
To me this seems like a client problem. Has anybody seen this yet? Will I
provoke severe problems if I make the above change to the file server?

Thanks,

Volker

Attachment: pgpJt3ZS2ne4X.pgp
Description: PGP signature

Reply via email to