On 7/24/2017 11:22 AM, Susan Litzinger wrote:
> I have a number of temp volumes that I'm trying to delete but having
> problems within our AFS filesystem.   Does anyone have a suggestion on
> how to diagnose the 'Possible communication failure' error?  I'm not
> finding anything on google.  Here is an example of a volume that I try
> to remove but it won't go totally away. 
> 
> 
> bash-3.2# vos remove -server velma.psc.edu <http://velma.psc.edu>
> -partition /vicepcd -id tmp.users.9.zzhao3 -localauth
> WARNING: Volume 537606306 does not exist in VLDB on server and partition
> Volume 537606306 on partition /vicepcd server velma.psc.edu
> <http://velma.psc.edu> deleted
> 
> bash-3.2# vos volinfo -id tmp.users.9.zzhao3 -localauth
> Could not fetch the information about volume 537606306 from the server
> Possible communication failure
> Error in vos examine command.
> Possible communication failure
> 
> Dump only information from VLDB
> 
> tmp.users.9.zzhao3
>     RWrite: 537606306
>     number of sites -> 1
>        server velma.pvt.psc.edu <http://velma.pvt.psc.edu> partition
> /vicepcd RW Site
> 
> 
> bash-3.2# vos syncvldb -server velma.psc.edu <http://velma.psc.edu>
> -partition vicepcd -volume tmp.users.9.zzhao3 -dryrun -localauth -verbose
> Processing VLDB entry tmp.users.9.zzhao3 .

Susan,

According to DNS velma.psc.edu != velma.pvt.psc.edu:

Non-authoritative answer:
Name:    velma.pvt.psc.edu
Address:  10.32.5.186

Non-authoritative answer:
Name:    velma.psc.edu
Addresses:  2001:5e8:2:42::b8
          128.182.66.184


The psc.edu cell's VLDB believes that these addresses are separate
fileservers:

UUID: None
[10.32.5.185]:7005

UUID: None
[128.182.73.70]:7005

UUID: None
[128.182.73.72]:7005

UUID: None
[128.182.73.73]:7005

UUID: None
[128.182.40.71]:7005

UUID: None
[128.182.73.74]:7005

UUID: None
[128.182.73.75]:7005

UUID: None
[128.182.73.77]:7005

UUID: None
[10.32.5.186]:7005

UUID: None
[127.0.0.1]:7005

UUID: 002167fe-84dd-1ace-8751-b63bb680aa77
[128.182.59.182]:7005

UUID: 008e9914-f6d3-1a6d-b108-b942b680aa77
[128.182.66.185]:7005

UUID: 0029cfd4-6cc8-1a32-9a9e-b842b680aa77
[128.182.66.184]:7005

UUID: 002fb7a0-0e2a-13b8-a68d-b53bb680aa77
[128.182.59.181]:7005

UUID: 00376e3c-e32a-1acc-a42b-0100007faa77
[128.182.59.77]:7005

Since your cell is newer than IBM AFS 3.4, there should no longer be
file server entries in the VLDB that are not assigned a UUID.  My guess
is that they are left over from an attempt to manually modify a
fileserver's IP address.

velma.pvt.psc.edu [10.32.5.186] has 4131 volume entries in the VLDB.

velma.psc.edu [128.182.66.184] has 22947 volume entries in the VLDB.

If these are intended to be the same server, you might want to consider
rebuilding your VLDB from scratch.

Jeffrey Altman

<<attachment: jaltman.vcf>>

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to