Re: SMF System Logger - limitations of MANx

Scott Barry Wed, 26 Mar 2008 11:03:41 -0700

On Wed, 26 Mar 2008 09:38:10 -0700, Skip Robinson 
<[EMAIL PROTECTED]> wrote:


>We've had occasional episodes of lost data.
>
>One case, which we think we've fixed via changes to automation, is the
>other side of the event-driven nature of MANx management. A strength of
>traditional SMF recording is that you dump and clear a MANx cluster in
>response to a message saying that it needs dumping. That's why you don't
>have to worry much about whether you've collected ten hours or ten minutes
>worth of data. You dump whatever's there, clear the file, and move on. A
>problem crops up when you IPL with all defined SMF data sets in need of
>dumping. They're not necessarily 'full'; they just can't be opened because,
>well, who knows why. I've seen MANx data sets at 1% that nonetheless
>require dumping. Go figure. But if all MANx data sets are full at IPL, data
>starts getting buffered. In our shop, the messages that should trigger
>dumping come out before our automation product has hit the ground. Our
>so-far-untested automation change is to trigger a dump when buffering has
>reached a certain threshold. Without that additional check, you can lose
>data when you run out of buffers.
>
>Another case is harder to deal with. We dump straight to tape. Once in a
>while we have an extended tape outage for hardware or software 
maintenance.
>Of course we could code for this situation by writing first to DASD, then
>to tape. Heck, we could write a whole subsystem. Ad infinitum.
>
>System Logger in principle handles all these cases by its nature. As SMF
>records flow, Logger captures them and writes them to DASD. A new offload
>data set is allocated as often as necessary: once a day or once a minute.
>The full-at-IPL condition simply goes away. Logger never gets full. If tape
>is unavailable for some period of time, records simply accumulate until a
>dump job can eventually be run.
>
>The worst fallout of SMF data loss is that all records are treated alike.
>It's simple FIFO, records you need, records you might like to have, records
>that may or may not prove crucial down the road. You capture them all until
>recording is impaired, then you lose everything. The current complexity of
>managing Logger offload data is presumably temporary, although no relief
>has actually been announced. Given the well known historical problems with
>MANx data sets, I still think Logger is worth the effort today.
>
>.
>.
>JO.Skip Robinson
>Southern California Edison Company
>Electric Dragon Team Paddler
>SHARE MVS Program Co-Manager
>626-302-7535 Office
>323-715-0595 Mobile
>[EMAIL PROTECTED]
>
>
>
>             "Vernooy, C.P. -
>             SPLXM"
>             <[EMAIL PROTECTED]                                          To
>             .COM>                     [email protected]
>             Sent by: IBM                                               cc
>             Mainframe
>             Discussion List                                       Subject
>             <[EMAIL PROTECTED]         Re: SMF System Logger - limitations
>             .edu>                     of MANx
>
>
>             03/26/2008 12:44
>             AM
>
>
>             Please respond to
>               IBM Mainframe
>              Discussion List
>             <[EMAIL PROTECTED]
>                   .edu>
>
>
>
>
>
>
>"R.S." <[EMAIL PROTECTED]> wrote in message
>news:<[EMAIL PROTECTED]>...
>> Mark Zelden wrote:
>> [...]
>> > As mentioned... lots in the archives about this (even before the
>recent
>> > threads).
>> >
>> > 1) Speed of offloading (being able to keep up with records being
>written).
>> IMHO I can live with it (YMMV). In case of SMF (expected & accepted)
>> flood I can use more MANx datasets as a spill.
>
>This is not the solution: I have seen occasions where records are
>presented to SMF faster that SMF can write them to MANx. This will cause
>SMF to run out of buffers with records lost as a consequence.
>
>Kees.
>
>----------------------------------------------------------------------
>For IBM-MAIN subscribe / signoff / archive access instructions,
>send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
>Search the archives at http://bama.ua.edu/archives/ibm-main.html


Two points for consideration:

1) a possible technique for avoiding SMF data loss (such as SMF 101s or 
116s -- you can decide an approach and types considered non-critical) is to 
setup an automation rule that fires when the SYSLOG message occurs, 
stating "SMF is at 75% BUFFERS..." (message IEE986E).  The SET SMF 
command would then enable an alternate SMFPRMxx member deactivating 
some subset of non-critical, large-volume contributor SMF record type(s), 
such as SMF 101s.  At some point, the condition is relieved and the normal 
production SMFPRMxx member would be re-enabled in a similar fashion, or 
after some defined time-period.

2) the issue of duplicate SMF data occurring across IFASMFDL (SMF 
Logstream enabled) dumps (see note #1 below) still does not get addressed, 
without requiring a secondary data-filter pass (and/or sort with 
noduplicates).  Some would say that this duplicate-data condition is 
unacceptable, given the extra data handling required.  And it doesn't have to 
be linked to any accounting/chargeback scenario, either.


Scott Barry
SBBWorks, Inc.
_________________________

Note #1:  most concerning is intraday dumping, for example, using CA MICS 
Incremental Database Update feature -- although MICS will reject any data 
already processed.

Links:

BUFSIZMAX, BUFUSEWARN, and NOBUFFS - Specifying SMF buffer options 
(mind any broken URL):
http://publib.boulder.ibm.com/infocenter/zos/v1r9/topic/com.ibm.zos.r9.ieag20
0/buffopt.htm

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Re: SMF System Logger - limitations of MANx

Reply via email to