On 11/6/19 4:14 PM, Nathan Lynch wrote:
Hi Tom,
Thomas Falcon <tlfal...@linux.ibm.com> writes:
After a migration, it is necessary to send a gratuitous ARP
from all running interfaces so that the rest of the network
is aware of its new location. However, some supported network
devices are unaware that they have been migrated. To avoid network
interruptions and other unwanted behavior, force a GARP on all
valid, running interfaces as part of the post_mobility_fixup
routine.
[...]
@@ -331,6 +334,8 @@ void post_mobility_fixup(void)
{
int rc;
int activate_fw_token;
+ struct net_device *netdev;
+ struct net *net;
activate_fw_token = rtas_token("ibm,activate-firmware");
if (activate_fw_token == RTAS_UNKNOWN_SERVICE) {
@@ -371,6 +376,21 @@ void post_mobility_fixup(void)
/* Possibly switch to a new RFI flush type */
pseries_setup_rfi_flush();
+ /* need to force a gratuitous ARP on running interfaces */
+ rtnl_lock();
+ for_each_net(net) {
+ for_each_netdev(net, netdev) {
+ if (netif_device_present(netdev) &&
+ netif_running(netdev) &&
+ !(netdev->flags & (IFF_NOARP | IFF_LOOPBACK)))
+ call_netdevice_notifiers(NETDEV_NOTIFY_PEERS,
+ netdev);
+ call_netdevice_notifiers(NETDEV_RESEND_IGMP,
+ netdev);
+ }
+ }
+ rtnl_unlock();
+
This isn't an outright nak, but this is not nice. It illustrates the
need to rethink the pseries partition migration code. There is no
mechanism for drivers and other interested code to prepare for a
migration or to adjust to the destination. So post_mobility_fixup() will
continue to grow into a fragile collection of calls into unrelated
subsystems until there is a better design -- either a pseries-specific
notification/callback mechanism, or something based on the pm framework.
My understanding is that this is needed specifically for ibmveth and,
unlike ibmvnic, the platform does not provide any notification to that
driver that a migration has occurred, right?
Correct, the ibmveth device, unlike ibmvnic, receives no signal or
notification at all in the event of a partition migration, so it can not
handle it or send a gratuitous ARP because from the driver's perspective
nothing has changed. As you've described, there is no existing notifier
in the kernel to inform interested parties that the system has migrated
or is about to migrate. Without adding the needed infrastructure to do
that, I'm not sure how else to fix this.
Tom