Public bug reported: The Neutron API, when using WSGI, stops refreshing the hash ring node if there is no activity. The OVN IDL connections (NB and SB) also disconnect.
There are plenty of errors in the testing job [1]. For example, from [2]: * The last command executed before running the tempest tests happen at 08:22:41. * The tempest tests start at 08:24:23 [3]. * There is an activity gap in the Neutron API logs of around 100 seconds (from [4]): """ Jan 23 08:22:41.500324 np0039651784 [email protected][60810]: [pid: 60810|app: 0|req: 10/40] 2001:4802:7805:104:be76:4eff:fe20:1d8f () {68 vars in 1517 bytes} [Thu Jan 23 08:22:40 2025] POST /networking/v2.0/subnets => generated 665 bytes in 840 msecs (HTTP/1.1 201) 4 headers in 162 bytes (1 switches on core 0) Jan 23 08:24:23.878996 np0039651784 [email protected][60811]: DEBUG futurist.periodics [-] Submitting periodic callback 'neutron.plugins.ml2.drivers.ovn.mech_driver.ovsdb.maintenance.HashRingHealthCheckPeriodics.touch_hash_ring_node' {{(pid=60811) _process_scheduled /opt/stack/data/venv/lib/python3.12/site-packages/futurist/periodics.py:638}} """ * One of the workers (PID 60810) restart the activity 2 minutes later after the last log line to update the hash ring (now it should be updated every 15 seconds). That leads to: ** An incorrect count of the active nodes: Hash Ring loaded. 3 active nodes. 0 offline nodes ** The IDL disconnections [4]. It is worth mentioning that uWSGI option "enable-threads" is enabled right now. [1]https://review.opendev.org/c/openstack/neutron/+/932601 [2]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/testr_results.html [3]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/controller/logs/tempest_log.txt [4]https://paste.opendev.org/show/buu2O3Jt3AGcfTfsWrzf/ ** Affects: neutron Importance: Undecided Status: New -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to neutron. https://bugs.launchpad.net/bugs/2095590 Title: [OVN] Neutron API stops refreshing the hash ring node Status in neutron: New Bug description: The Neutron API, when using WSGI, stops refreshing the hash ring node if there is no activity. The OVN IDL connections (NB and SB) also disconnect. There are plenty of errors in the testing job [1]. For example, from [2]: * The last command executed before running the tempest tests happen at 08:22:41. * The tempest tests start at 08:24:23 [3]. * There is an activity gap in the Neutron API logs of around 100 seconds (from [4]): """ Jan 23 08:22:41.500324 np0039651784 [email protected][60810]: [pid: 60810|app: 0|req: 10/40] 2001:4802:7805:104:be76:4eff:fe20:1d8f () {68 vars in 1517 bytes} [Thu Jan 23 08:22:40 2025] POST /networking/v2.0/subnets => generated 665 bytes in 840 msecs (HTTP/1.1 201) 4 headers in 162 bytes (1 switches on core 0) Jan 23 08:24:23.878996 np0039651784 [email protected][60811]: DEBUG futurist.periodics [-] Submitting periodic callback 'neutron.plugins.ml2.drivers.ovn.mech_driver.ovsdb.maintenance.HashRingHealthCheckPeriodics.touch_hash_ring_node' {{(pid=60811) _process_scheduled /opt/stack/data/venv/lib/python3.12/site-packages/futurist/periodics.py:638}} """ * One of the workers (PID 60810) restart the activity 2 minutes later after the last log line to update the hash ring (now it should be updated every 15 seconds). That leads to: ** An incorrect count of the active nodes: Hash Ring loaded. 3 active nodes. 0 offline nodes ** The IDL disconnections [4]. It is worth mentioning that uWSGI option "enable-threads" is enabled right now. [1]https://review.opendev.org/c/openstack/neutron/+/932601 [2]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/testr_results.html [3]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/controller/logs/tempest_log.txt [4]https://paste.opendev.org/show/buu2O3Jt3AGcfTfsWrzf/ To manage notifications about this bug go to: https://bugs.launchpad.net/neutron/+bug/2095590/+subscriptions -- Mailing list: https://launchpad.net/~yahoo-eng-team Post to : [email protected] Unsubscribe : https://launchpad.net/~yahoo-eng-team More help : https://help.launchpad.net/ListHelp

