Re: z/VM 4.2 Dispatching Problem running about 120 Li nux copies

Sivey,Lonny Fri, 20 Sep 2002 13:36:42 -0700

As a temporary measure I issued SET QUICKDSP ON for all the Linux images
except 1.  This caused some immediate paging activity that has since dropped
back down to normal levels.  There are not currently any Linux systems
waiting on the eligible queue.

All SRM Values are still set to their defaults except MAXWSS.

q srm
IABIAS : INTENSITY=90%; DURATION=2
LDUBUF : Q1=100% Q2=75% Q3=60%
STORBUF: Q1=125% Q2=105% Q3=95%
DSPBUF : Q1=32767 Q2=32767 Q3=32767
DISPATCHING MINOR TIMESLICE = 5 MS
MAXWSS : LIMIT=5%
...... : PAGES=17956
XSTORE : 0%

q stor
STORAGE = 1536M

q xstor
HCPXST1401I Expanded storage is not available within this hardware
configuration.

q alloc page
            EXTENT EXTENT  TOTAL  PAGES   HIGH    %
VOLID  RDEV  START    END  PAGES IN USE   PAGE USED
------ ---- ------ ------ ------ ------ ------ ----
420RES 2271    194    277  15120  15120  15120 100%
               639    688   9000   8997   9000  99%
VMPASP 2281      0   1499 270000 133939 269977  49%
VMPA01 227E      0   3338 601020 132574 285116  22%
VMPA02 227F      0   3338 601020 136540 285106  22%
VMPA03 2280      0   3338 601020 131633 285112  21%
VMPA04 2282      0   3338 601020 131783 277199  21%
VMPA05 2283      0   3338 601020 135393 284971  22%
                          ------ ------        ----
SUMMARY                    3299K 825979         25%
USABLE                     3299K 825979         25%

ind
AVGPROC-035% 01
MDC READS-000003/SEC WRITES-000000/SEC HIT RATIO-100%
STORAGE-094% PAGING-0001/SEC STEAL-000%
Q0-00126(00000)                           DORMANT-00012
Q1-00000(00000)           E1-00000(00000)
Q2-00000(00000) EXPAN-001 E2-00000(00000)
Q3-00001(00000) EXPAN-002 E3-00000(00000)

PROC 0000-035%

LIMITED-00000

-----Original Message-----
From: Malcolm Beattie [mailto:[EMAIL PROTECTED]]
Sent: Friday, September 20, 2002 4:10 PM
To: [EMAIL PROTECTED]
Subject: Re: z/VM 4.2 Dispatching Problem running about 120 Li nux
copies

Davis, Jeff writes:
> Your indicate command shows that you have 8 users in the E list.  This
> happens when you don't have enough available real storage to fit a user's
> working set in to real storage.  When a user is in the E list, the system
> appears to be unavailable or down to that user.  He cannot run.  You can
do
> a couple of things.  The best is to get more real storage.  Unfortunately,
> that's not so easy.  Second, you can set your SRM settings to over
allocate
> real storage.  This will prevent users from going into the E list, but
will
> also drive up your paging rate.  Make sure you have plenty of paging
> resource to do this.

If you post the result of
    QUERY SRM
along with
    QUERY STORE
    QUERY XSTORE
    QUERY ALLOC PAGE
then we can check whether some of your SRM settings are unnecessarily
preventing the Linux guests from being dispatched. For example, if
your DSPBUF setting was inherited from one intended for CMS guests
then it may include a "reserve" for Q1 and Q2 guests which Linux will
never be able to make use of.

As Jeff says, the next most likely reason is that storage is not being
overcommitted enough. In the absence of the timer-on-demand patch and/or
fancy footwork with mm configuration changes/patches, CP sees the entire
storage allocation of each Linux guest as one large clumped working set,
despite Linux being fairly happy (usually) to have some of it ripped
away. Because of this, in many Linux situations you can allow CP to
overcommit real storage rather heavily (while increasing paging space
to back it, of course) without the paging space actually being needed
most of the time. That's the STORBUF setting for SRM.

There's also the LDUBUF setting which causes guests to be kicked into the
eligible list if CP thinks that they are loading the paging subsystem
too much. That estimate is calculated based on how many "exposures"
the paging subsystem has: basically, how many volumes it can page to in
parallel, modulo some tweakable multipliers. The Q ALLOC PAGE I asked
for above will let you work out if that's likely to need tweaking when
you start increasing STORBUF (and assuming Linux guests turn out to need
to page significantly).

--Malcolm

--
Malcolm Beattie <[EMAIL PROTECTED]>
Linux Technical Consultant
IBM EMEA Enterprise Server Group...
...from home, speaking only for myself

Re: z/VM 4.2 Dispatching Problem running about 120 Li nux copies

Reply via email to