On Wed, Jul 23, 2008 at 4:12 PM, Andreas Hirczy <[EMAIL PROTECTED]> wrote:
> Steve Devine <[EMAIL PROTECTED]> writes: > > > Andreas Hirczy wrote: > >> > >> My AFS cell works ok in most scenarios, but since a reboot of one > DB-server > >> last friday no vos command besides "vos help" finishes - e.g. "vos exa > >> root.afs -localauth -verbose" hangs indefinitely and does not produce > any > >> output. Log files are also basically empty. File access works perfectly > but I > >> cannot create or move volumes; no backup of course. > > > > Sounds like firewall to me. can you run vos listvldb root.afs -localauth > on > > the db server? > > No firewall, but "vos listvldb root.afs -localauth" worked. Talks to the vlserver, only > And a miracle > occured: after 10 hours of observed outage "vos exa ...." for volumes not > on > the blocking fileserver works again. > vos examine talks to the volservers. ok, well, > Very strange: no entrys in the log files for 2 hours since last reboot and > salvage. It did not work then. There are still 74 blocked connections on > one > fileserver, but that could be a different problem. "man fileserver" seems > to > indicate, that this number will never go down again until restart. > Unluckily > "vos listvol" still runs slow - but triggers some logging messages at last: > > ==> /var/log/openafs/VolserLog <== > Wed Jul 23 21:23:28 2008 FSYNC_clientInit temporary failure (will retry) > Wed Jul 23 21:23:44 2008 FSYNC_clientInit temporary failure (will retry) > Wed Jul 23 21:24:08 2008 FSYNC_clientInit temporary failure (will retry) > Wed Jul 23 21:24:40 2008 FSYNC_clientInit temporary failure (will retry) > Wed Jul 23 21:25:20 2008 FSYNC_clientInit temporary failure (will retry) > > ==> /var/log/openafs/BosLog <== > Wed Jul 23 21:26:08 2008: fs:vol exited on signal 6 > > ==> /var/log/openafs/VolserLog <== > FSYNC_clientInit failed (giving up!): Connection refused > Wed Jul 23 21:26:08 2008 > : Assertion failed! file ../vol/volume.c, line 705. dead volserver would of course explain a hang. the volserver will restart with an fs outage. got a corefile?
