after the successful fail over from primary to backup RDEVs for the OSAs, what 
state was reported for the primary RDEVs?
David


-----Original Message-----
From: The IBM z/VM Operating System on behalf of Brian Nielsen
Sent: Mon 4/24/2006 1:39 PM
To: [email protected]
Subject: [IBMVM] VSWITCH Controller failover issue on z/VM 5.2
 
This weekend the LAN team upgraded the CISCO router connected to the OSA =

card.  The VSWITCH controller console shows the message:

   DTCOSD309W Received adapter-initiated Stop Lan

after which it tries to fail over from the primary OSA to the backup OSA.=
  
Eventually I see:

   DTCOSD306I Received adapter-initiated Start Lan

and the backup OSA addresses get started fine and assign all the IP 
addresses.

There are several more iterations of Stop Lan and Start Lan, but it is =

always on the backup OSA addresses, never on the primary OSA addresses.

So the question is: why didn't it ever try to restart on the primary OSA =

when the backup OSA received the Stop Lan?  If both OSA cards were the =

same it might not matter that much, but the primary is a Gig-OSA and the =

backup is a Fast-OSA.

-----

Oh, and just to make it more fun, lastly there is a Stop Lan, the various=
 
normal messages about stopping, attempting to restart, and:

   AMPX036I ASSERTION FAILURE CHECKING ERROR
         TRACE BACK OF CALLED ROUTINES
    ROUTINE                          STMT AT ADDRESS IN MODULE
    SPSM_BLKALLOCATE                   39    00D429D0   TCFPSM_FPSM
    INITDCB                            48    00DA079A   TCTOOSD_TOOSD
    TOOSDINIT                          33    00DA2DF4   TCTOOSD_TOOSD
    CALLINITRTN                        30    00CB9506   TCPARSE_PARSETCP
    PROCESSSTARTSTOPSTATEMENT          33    00CAE37E   TCPARSE_PARSETCP
    PARSEOPTION                       292    00CB3C00   TCPARSE_PARSETCP
    RECEIVECONTROLLERMSGFROMCP         25    00D9F010   TCTOOSD_TOOSD
    TOIUCV                            180    00D3D580   TCTOIUC_TOIUCV
    Schedule                         1670    00CFA118
    <MAIN-PROGRAM>                     14    00C441FE   TCPIP
    VSPASCAL                                 00E4D74A

   DTCOSD100E Insufficient Fixed Page Storage Pool storage

after which it tries to shutdown and then goes into repeated:

  AMPX015I ADDRESSING EXCEPTION

abends.  Needless to say there were a lot of dump files in spool space.

So, IBM will get a call on the abends.  I could add more virtual storage =

to the controllers, but that will presumably only add cushion for more =

stops & restarts before it runs out of storage again.

Brian Nielsen

Reply via email to