Pawel,

When sorting very large files one consideration must be whether the file is
to be sorted multiple times.

Also, it can be significantly more efficient to split the file into smaller
subsets, sort the subsets, then perform a merge to recombine them.

What you have to remember is that the time required for a sort does not grow
linearly with the number of records, but is an exponential function, when
the exact equation is dependent upon the total number of records, the
sorting algorithm chosen, and the ordering of the key fields in the input
file.  A worst case scenario can be calculated.  However, the mathematics is
difficult and far too few analysts these days are willing to take the time.
If you are really interesting, take a look at "The Art of Computer
Programming, v3, "Sorting and Searching", by Donald E. Knuth

John P. Baker

-----Original Message-----
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On Behalf
Of Pawel Leszczynski
Sent: 03/09/2008 5:06 PM
To: [email protected]
Subject: how fast can I sort on mainframe (using DFSORT)?

Hello everybody,

I realize subject is VEEEERY broad and my question VEEEERY general, but...

Recently in our shop we are reviewing our whole batch processing.
Most of the time of EOD processing is consumed by sort of many big
sequential 
files. (One such file has approximately order of 10GB, 10mln records)
I listed few tens of such batch jobs (the longest-lasting ones)
and computed mean sorting rate.
It appeared to be about 1GB/min ~ 17MB/sec

I suppose it's very poor result(???).

Can you tell me how much I can improve this?

These batch jobs are little CPU-consuming (~10% of one CPU),
I suppose that major concern is to:

-limit EXCPS (1)
-increase throughput rate from DASD to central storage (2)

I realize that sorting whole file in central storage (hiperspace sorting)
would 
eliminate need to use work files and EXCPS to them.
How much central storage is needed to handle in-storage sorting for
let's say 10 GB file???

How much can I  improve (2)?

Can you tell me what is mean sorting rate in your installation?
Before starting I would just like to know if I can achieve substantial 
improvement.

TIA,
Pawel Leszczynski
PKO BP SA

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Reply via email to