Hello everyone,

People have noticed that they aren't getting some FMN notifications, so
I spent the afternoon tracking down the issue. It looks as though the
redis instance on notifs-backend01.phx2.fedoraproject.org is configured
to persist to disk (this is apparently the default config, which is
surprising to me). In addition, dogpile.cache is apparently not setting
a TTL on keys despite being configured to do so.

All this means that everything in the FMN redis cache is _really_ stale
and contains invalid objects. For example, the packages a user has ACLs
on is cached as a set currently, but the latest version of FMN expects
a dictionary. This leads to messages not getting sent that should have
been sent.

I would like to delete the persisted database (/var/lib/redis/dump.rdb)
and restart redis to wipe the cache. FMN does not like redis going down
so the workers and backend will likely need to be restarted, and because
of a different caching issue[0] this will likely take around an hour
(messages won't be lost, they'll just build up in the rabbitmq queue).

[0] https://github.com/fedora-infra/fmn/pull/190

+1s?

-- 
Jeremy Cline
XMPP: [email protected]
IRC:  jcline

Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
infrastructure mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to