Joel,

Why do you wait to the next day - when I want SMF data in a hurry I just
read the SYS1.MANx directly. Waiting until the next day to read 1 million
plus SMF records is a procedure problem and not a technical issue.

As far as the accuracy of VSAM statistics go, I concede that using SMF is no
more accurate than using LISTCAT. The same job that abends without updating
the statistics will also fail produce a type 64 records. I just happen to
think it is simpler to read a type 64 record than it is to insert a LISTCAT
before and after every job step that access a VSAM file. YMMV.

There are two basic problems with VSAM statistics. One is that you may be
missing frequency counts for inserts, splits, deletions, EXCP and whatever
for anything that has a VSAM file open when the power fails, the system
crashes, or the address itself fails for whatever reason. The counts for
those jobs are gone forever. I really don't see how a verify could possibly
correct that. How important these counters are to you really depends on what
you are doing with them, but where I have worked it is common practice to
start out any LISTC based analysis with a freshly reorged file, and discard
anything that has been touched by a failure.

The second problem is that there are counts such as number of records that
can become invalid. For example, if I have a KSDS with 1000 records, and I
have inserted 1500 more records when my CICS region abends, then my LISTCAT
will still show 1000 records. If subsequent processing then deletes 1001
records then my LISTCAT will be negative 1 record, while the actual count is
1499. If Verify is to rectify this statistic then it is going to have to
read the whole KSDS from front to back and count the records. And the logic
to actually figure out how the number of CI and CA splits that have occurred
in prior Opens really escapes me.

Calculating the difference for various statistics from a LISTCAT before and
after a job step is a perfectly valid and simple way to measure VSAM
activity for that open. Comparing before and after statistics in this way is
unaffected by the accuracy of the statistics, because it is the difference
between the two that you are measuring. But if you want to take a LISTCAT of
a VSAM dataset that was defined 18 months ago and use those statistics for
some sort meaningful tuning or capacity planning - well I for one fail to
see how it would be useful whether the statistics were in error or not.

Ron

> -----Original Message-----
> From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
> Behalf Of Joel C. Ewing
> Sent: Saturday, 2 July 2005 3:50 PM
> To: [email protected]
> Subject: Re: IBM VSAM Statistics are often Bogus
> 
> Ask yourself which is easier and faster in a case where one has reason
> to suspect a VSAM usage problem:
> (1) running LISTCAT, running program, running LISTCAT;
> or
> (2) running program, waiting until following day when offloaded SMF
> records become generally available, running or requesting a run of a
> batch program to read through 1M+ SMF records to possibly locate
> relevant statistics?
> 
> The "simplest" approach is frequently "better".
> 
> The fallacy of John's casino analogy is that the underreporting of VSAM
> statistics is not caused by unpredictable and unknown actions of
> individuals, but by specific understood system events which are mostly
> observable (ABEND or cancellation of address spaces) or associated with
> the running of specific programs.  You may not know exactly what was
> lost, but generally can determine when the losses occurred.  There are
> many useful cases where one can be 100% sure that you have an interval
> in which no statistics have been lost, and can therefore accurately
> assess the number of additional EXCP's, CI/CA splits, records
> fetched/added/deleted, etc. resulting from applications run in that
> interval.
> 
> As with many statistics related to MVS, one has to understand what they
> mean and their limitations in order to use them intelligently.  I don't
> accept that it makes sense to eliminate LISTCAT VSAM statistics just
> because some users don't understand the limitations.  Flagging the
> statistics as incomplete or questionable ought to be sufficient to warn
> neophyte users that there may be something about the values they don't
> understand - hopefully prompting them to research and understand the
> limitations.
> 
> 
> 

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Reply via email to