(top of my head):
* AFS clients and servers (even on the same machine) use different locations for CellServDB. It looks like the client file has all 3 hosts (which would explain the intermittent results), but the servers might not know about each other, which would mean elections don't work.

* suggest to check ptserver status on the servers (should print version)
for i in iap1 ipa2 ipa3; do rxdebug "${i}" 7002; done

* check election status for ptserver:
for i in iap1 ipa2 ipa3; do udebug "${i}" 7002; done

The current master will print all participants, and all servers should agree who is "sync site". If tehre are message about unknown servers, see "server" CellServDB

You also might want to split your problems:

* ptserver elections - needs service key. Then just use "-localauth"
* same for vlserver (port 7003, same thing as above)
* kerberos integration (i.e identifying "admin" account from Kerberos ticket -> AFS token) * volume releasing - needs service key, vlserver election to work, admin account to be identified.


PS: please re-key, and do not post the new "asetkey list" again..
Cheers
jan


On 10/02/2019 10:51, Joseph Timothy Foley wrote:
Hi all

I’ve been getting help on the IRC channel setting up a new cell for our CS department, but I’ve hit a roadblock that may need a 1.8.2 debugging expert. (Many thanks to auristor, billings, and patbarron)

I have setup 3 Centos7 hosts with IPA:  ipa1.cs.ru.is, ipa2, ipa3.

IPA2 is the lowest numbered (for historical reasons) and is the Kerberos primary.

The other two are replication sites.

I have setup the Openafs clients using the yum packages

I’ve tried to follow the quickstart and https://wiki.openafs.org/admin/InstallingOpenAFSonRHEL/

To the best of my ability, but I think something is wrong with the Protection server.

I’ve checked with rxdebug and there is connectivity between the 3 machines

I’ve added both “admin” and “foley” to system:adminstrators and using “bos adduser” to all the machines.  “bos listuser” verifies this.

Symptom:

“pts membership admin” as admin works intermittently

[foley@ipa2 .cs.ru.is]$ pts membership admin

Groups admin (id: 1) is a member of:

   system:administrators

[foley@ipa2 .cs.ru.is]$ pts membership admin

pts: Permission denied ; unable to get membership of admin (id: 1)

But with “-localauth” it always works.

[foley@ipa2 .cs.ru.is]$ klist -e

Ticket cache: KEYRING:persistent:1298400006:krb_ccache_qrL87VL

Default principal: [email protected]

Valid starting       Expires              Service principal

02/10/2019 09:42:12  02/11/2019 09:42:06  afs/[email protected]

         Etype (skey, tkt): aes256-cts-hmac-sha1-96, aes256-cts-hmac-sha1-96

02/10/2019 09:42:10  02/11/2019 09:42:06  krbtgt/[email protected]

         Etype (skey, tkt): aes256-cts-hmac-sha1-96, aes256-cts-hmac-sha1-96

[foley@ipa2 .cs.ru.is]$ bos listusers ipa2

SUsers are: admin foley

[foley@ipa2 .cs.ru.is]$ pts examine admin

pts: Permission denied ; unable to find entry for (id: 1)

[foley@ipa2 .cs.ru.is]$ pts examine admin

Name: admin, id: 1, owner: system:administrators, creator: system:administrators,

   membership: 1, flags: S----, group quota: unlimited.

Possibly relevant logs:

On ipa2:  (the lowest ip address) after a restart in /var/openafs/logs/PTLog

Sun Feb 10 09:33:18 2019 Using 130.208.243.201 as my primary address

Sun Feb 10 09:33:18 2019 Starting AFS ptserver 1.1 (/usr/libexec/openafs/ptserver)

Sun Feb 10 09:33:21 2019 ubik: A Remote Server has addresses:

Sun Feb 10 09:33:21 2019 ... 130.208.243.202

Sun Feb 10 09:33:24 2019 ubik: A Remote Server has addresses:

Sun Feb 10 09:33:24 2019 ... 130.208.243.205

But no mention of an election.  I only see an election in the BackupLog.

I’ve tried setting a new key, just in case I got confused.

[root@ipa2 logs]#  asetkey list

rxkad_krb5      kvno    1 enctype 17; key is: 3c54d85bad8dd99f938307e1a4bff2d5

rxkad_krb5      kvno    1 enctype 18; key is: a55c654701f21cd871278f09727ee9c6e7809f05f8eeebdfea9777e94f610ce1

rxkad_krb5      kvno    2 enctype 17; key is: 81f4e3ce6b8179833ad21a8539489a68

rxkad_krb5      kvno    2 enctype 18; key is: b90bbfbb11aa16a2cb0079b66467fa517bdaa4af101ab6ffab400cc6471c827e

All done.

(I’ve checked these on all 3 to make sure they were the same)

Trying to delete the old key gives an error

[root@ipa2 logs]# asetkey delete 1

asetkey: Unknown code acfg 1 (70354689) while deleting key 1

Symptom 2:

I can’t release a read-only volume with those tickets

[foley@ipa2 .cs.ru.is]$ vos addsite ipa2 a root.afs

Could not lock the VLDB entry for the volume 536870915

VLDB: no permission access for call

Error in vos addsite command.

VLDB: no permission access for call

But –localauth works fine

[root@ipa2 logs]# vos addsite ipa2 a root.afs -localauth

Added replication site ipa2 /vicepa for volume root.afs

Symptom 3:

Even with all these issues, admin and foley can both create folders in the RW volume of the cell!

System and Package information (all 3 hosts should be identical):

[foley@ipa2 user]$ uname -a

Linux ipa2.cs.ru.is 3.10.0-957.1.3.el7.x86_64 #1 SMP Thu Nov 29 14:49:43 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

Name        : openafs-client

Arch        : x86_64

Version     : 1.8.2

Release     : 1.el7

Size        : 1.1 M

Repo        : installed

 From repo   : storage-sig

Name        : openafs-server

Arch        : x86_64

Version     : 1.8.2

Release     : 1.el7

Size        : 9.1 M

Repo        : installed

 From repo   : storage-sig

Any help would be appreciated.

Kind regards,

Joe

--

Dr. Joseph T. Foley <[email protected]> Assistant Professor, Dept. of Science & Engineering, Reykjavik University

Menntavegur 1, Nauthólsvík | 101 Reykjavík | Iceland | Phone: +354-599-6569 | Fax +354-599-6201 | www.ru.is


_______________________________________________
OpenAFS-info mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-info

Reply via email to