Public bug reported:

The Neutron API, when using WSGI, stops refreshing the hash ring node if
there is no activity. The OVN IDL connections (NB and SB) also
disconnect.

There are plenty of errors in the testing job [1]. For example, from [2]:
* The last command executed before running the tempest tests happen at 08:22:41.
* The tempest tests start at 08:24:23 [3].
* There is an activity gap in the Neutron API logs of around 100 seconds (from 
[4]):
"""
Jan 23 08:22:41.500324 np0039651784 [email protected][60810]: [pid: 
60810|app: 0|req: 10/40] 2001:4802:7805:104:be76:4eff:fe20:1d8f () {68 vars in 
1517 bytes} [Thu Jan 23 08:22:40 2025] POST /networking/v2.0/subnets => 
generated 665 bytes in 840 msecs (HTTP/1.1 201) 4 headers in 162 bytes (1 
switches on core 0)
Jan 23 08:24:23.878996 np0039651784 [email protected][60811]: DEBUG 
futurist.periodics [-] Submitting periodic callback 
'neutron.plugins.ml2.drivers.ovn.mech_driver.ovsdb.maintenance.HashRingHealthCheckPeriodics.touch_hash_ring_node'
 {{(pid=60811) _process_scheduled 
/opt/stack/data/venv/lib/python3.12/site-packages/futurist/periodics.py:638}}
"""

* One of the workers (PID 60810) restart the activity 2 minutes later after the 
last log line to update the hash ring (now it should be updated every 15 
seconds). That leads to:
** An incorrect count of the active nodes: Hash Ring loaded. 3 active nodes. 0 
offline nodes
** The IDL disconnections [4].


It is worth mentioning that uWSGI option "enable-threads" is enabled right now.


[1]https://review.opendev.org/c/openstack/neutron/+/932601
[2]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/testr_results.html
[3]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/controller/logs/tempest_log.txt
[4]https://paste.opendev.org/show/buu2O3Jt3AGcfTfsWrzf/

** Affects: neutron
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/2095590

Title:
  [OVN] Neutron API stops refreshing the hash ring node

Status in neutron:
  New

Bug description:
  The Neutron API, when using WSGI, stops refreshing the hash ring node
  if there is no activity. The OVN IDL connections (NB and SB) also
  disconnect.

  There are plenty of errors in the testing job [1]. For example, from [2]:
  * The last command executed before running the tempest tests happen at 
08:22:41.
  * The tempest tests start at 08:24:23 [3].
  * There is an activity gap in the Neutron API logs of around 100 seconds 
(from [4]):
  """
  Jan 23 08:22:41.500324 np0039651784 [email protected][60810]: 
[pid: 60810|app: 0|req: 10/40] 2001:4802:7805:104:be76:4eff:fe20:1d8f () {68 
vars in 1517 bytes} [Thu Jan 23 08:22:40 2025] POST /networking/v2.0/subnets => 
generated 665 bytes in 840 msecs (HTTP/1.1 201) 4 headers in 162 bytes (1 
switches on core 0)
  Jan 23 08:24:23.878996 np0039651784 [email protected][60811]: 
DEBUG futurist.periodics [-] Submitting periodic callback 
'neutron.plugins.ml2.drivers.ovn.mech_driver.ovsdb.maintenance.HashRingHealthCheckPeriodics.touch_hash_ring_node'
 {{(pid=60811) _process_scheduled 
/opt/stack/data/venv/lib/python3.12/site-packages/futurist/periodics.py:638}}
  """

  * One of the workers (PID 60810) restart the activity 2 minutes later after 
the last log line to update the hash ring (now it should be updated every 15 
seconds). That leads to:
  ** An incorrect count of the active nodes: Hash Ring loaded. 3 active nodes. 
0 offline nodes
  ** The IDL disconnections [4].

  
  It is worth mentioning that uWSGI option "enable-threads" is enabled right 
now.

  
  [1]https://review.opendev.org/c/openstack/neutron/+/932601
  
[2]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/testr_results.html
  
[3]https://8a3e2af9f348776bb6b6-c0288c15cf27fe5a39c9948ecafb7329.ssl.cf2.rackcdn.com/932601/17/check/neutron-ovn-tempest-ipv6-only-ovs-release-wsgi-4/e3be37a/controller/logs/tempest_log.txt
  [4]https://paste.opendev.org/show/buu2O3Jt3AGcfTfsWrzf/

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/2095590/+subscriptions


-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp

Reply via email to