On 7/24/2017 11:22 AM, Susan Litzinger wrote: > I have a number of temp volumes that I'm trying to delete but having > problems within our AFS filesystem. Does anyone have a suggestion on > how to diagnose the 'Possible communication failure' error? I'm not > finding anything on google. Here is an example of a volume that I try > to remove but it won't go totally away. > > > bash-3.2# vos remove -server velma.psc.edu <http://velma.psc.edu> > -partition /vicepcd -id tmp.users.9.zzhao3 -localauth > WARNING: Volume 537606306 does not exist in VLDB on server and partition > Volume 537606306 on partition /vicepcd server velma.psc.edu > <http://velma.psc.edu> deleted > > bash-3.2# vos volinfo -id tmp.users.9.zzhao3 -localauth > Could not fetch the information about volume 537606306 from the server > Possible communication failure > Error in vos examine command. > Possible communication failure > > Dump only information from VLDB > > tmp.users.9.zzhao3 > RWrite: 537606306 > number of sites -> 1 > server velma.pvt.psc.edu <http://velma.pvt.psc.edu> partition > /vicepcd RW Site > > > bash-3.2# vos syncvldb -server velma.psc.edu <http://velma.psc.edu> > -partition vicepcd -volume tmp.users.9.zzhao3 -dryrun -localauth -verbose > Processing VLDB entry tmp.users.9.zzhao3 .
Susan,
According to DNS velma.psc.edu != velma.pvt.psc.edu:
Non-authoritative answer:
Name: velma.pvt.psc.edu
Address: 10.32.5.186
Non-authoritative answer:
Name: velma.psc.edu
Addresses: 2001:5e8:2:42::b8
128.182.66.184
The psc.edu cell's VLDB believes that these addresses are separate
fileservers:
UUID: None
[10.32.5.185]:7005
UUID: None
[128.182.73.70]:7005
UUID: None
[128.182.73.72]:7005
UUID: None
[128.182.73.73]:7005
UUID: None
[128.182.40.71]:7005
UUID: None
[128.182.73.74]:7005
UUID: None
[128.182.73.75]:7005
UUID: None
[128.182.73.77]:7005
UUID: None
[10.32.5.186]:7005
UUID: None
[127.0.0.1]:7005
UUID: 002167fe-84dd-1ace-8751-b63bb680aa77
[128.182.59.182]:7005
UUID: 008e9914-f6d3-1a6d-b108-b942b680aa77
[128.182.66.185]:7005
UUID: 0029cfd4-6cc8-1a32-9a9e-b842b680aa77
[128.182.66.184]:7005
UUID: 002fb7a0-0e2a-13b8-a68d-b53bb680aa77
[128.182.59.181]:7005
UUID: 00376e3c-e32a-1acc-a42b-0100007faa77
[128.182.59.77]:7005
Since your cell is newer than IBM AFS 3.4, there should no longer be
file server entries in the VLDB that are not assigned a UUID. My guess
is that they are left over from an attempt to manually modify a
fileserver's IP address.
velma.pvt.psc.edu [10.32.5.186] has 4131 volume entries in the VLDB.
velma.psc.edu [128.182.66.184] has 22947 volume entries in the VLDB.
If these are intended to be the same server, you might want to consider
rebuilding your VLDB from scratch.
Jeffrey Altman
<<attachment: jaltman.vcf>>
smime.p7s
Description: S/MIME Cryptographic Signature
