Hello,

I have compiled openafs-snap-2005-01-10. Besides that getting a token and 
acessing AFS crashes the machine I found the following problem:

Without any token I tar a large AFS area with campus wide available files to
/dev/null. After some time I get the following error messages:

a rs_aix51/gaussian-03/g03/l405.hlp 6 blocks.
a rs_aix51/gaussian-03/g03/l502.exe 18195 blocks.
tar: 0511-182 Read error on afs: Lost contact with file server 10.1.2.26 in 
cell uni-freiburg.de (multi-homed addre
ss; other same-host interfaces maybe up)
afs: Lost contact with file server 10.1.2.27 in cell uni-freiburg.de 
(multi-homed address; other same-host interfac
es maybe up)
afs: Lost contact with file server 132.230.6.235 in cell uni-freiburg.de (all 
multi-homed ip addresses down for the
 server)
afs: Lost contact with file server 132.230.6.236 in cell uni-freiburg.de (all 
multi-homed ip addresses down for the
 server)
afs: setting clock back 10 seconds (of 45, via 10.1.2.26 in cell 
uni-freiburg.de); clock is still fast.
rs_aix51/gaussian-03/g03/l502.exe: A remote host did not respond within the 
timeout period.
a rs_aix51/gaussian-03/g03/l502.hlp 68 blocks.
tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l502.hlp: A remote host 
did not respond within the timeout per
iod.
a rs_aix51/gaussian-03/g03/l503.exe 2922 blocks.
tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l503.exe: A remote host 
did not respond within the timeout per
iod.
a rs_aix51/gaussian-03/g03/l503.hlp 7 blocks.
tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l503.hlp: A remote host 
did not respond within the timeout per
iod.
a rs_aix51/gaussian-03/g03/l504.exe 3307 blocks.
tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l504.exe: A remote host 
did not respond within the timeout per
iod.
a rs_aix51/gaussian-03/g03/l506.exe 6171 blocks.
tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l506.exe: A remote host 
did not respond within the timeout per
iod.
tar: rs_aix51/gaussian-03/g03/l506.hlp: A remote host did not respond within 
the timeout period.
.
. # the tar continues a little bit
.
a share/sw-tools-1.0/sbin/lnlibe 1 blocks.
a share/sw-tools-1.0/sbin/lnman 1 blocks.
a share/sw-tools-1.0/sbin/lnsbin 1 blocks.
a share/sw-tools-1.0/sbin/mkman 1 blocks.
a share/sw-tools-1.0/sbin/rmman 1 blocks.
a share/sw-tools-1.0/Links 1 blocks.
a share/sw-tools-1.0/Id 1 blocks.
a share/sw-tools-1.0/README 1 blocks.
a share/sw-tools-1.0/History 5 blocks.
tar: share/xyz: A remote host did not respond within the timeout period.

The tar the finishes.

� The fileservers are multihomed. The test-machine has no access to 10.1.2.x,   
     
the fileservers are only reachable by their 132.230.6.x adresses.

� After saying that "all multihomed ip-adresses are down" the test machine has 
no further access to AFS besides to some files which are stiil in the cache.
fs checkservers says always "These servers unavailable due to network or 
server problems:  sv6.ruf.uni-freiburg.de sv7.ruf.uni-freiburg.de".

� Stopping and starting AFS does not help. I have to reboot.

� During this state  a machine connected to the same hub is able to tar the 
same area without any problems. So there is no problem with the network or 
the servers itselves.

� The problem is reproducible.

� The problem does not show up if I replace the kernel extensions by those of 
Hartmut Reuter contained in his 15.03.04 Version.

Thanks in advance for any help.

Gunther
-- 
________________________________________________________________
Hans-Gunther Borrmann <[EMAIL PROTECTED]>
Rechenzentrum der Universitaet Freiburg
Hermann-Herder-Str. 10, D79104 FREIBURG
Tel.: +49 761/203-4652
Fax:  +49 761/203-4643

_______________________________________________
OpenAFS-info mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-info

Reply via email to