I set the split load based on what Dan suggested:

"I'd take the merge down a little, to maybe 30% or even less, and maybe knock 
the split up a bit - say, 90% - to cut down on the splitting."

I thought this would cut down on splitting. Is there a certain formula, or way 
to calculate the split.load? What should my SPLIT.LOAD be around,
and how do you come up with that %?

Chris

> From: r...@lynden.com
> To: u2-users@listserver.u2ug.org
> Date: Tue, 3 Jul 2012 14:45:28 -0700
> Subject: Re: [U2] RESIZE - dynamic files
> 
> 37% is a very low load.  Reading disk records takes much longer than parsing 
> the records out of a disk record.  With variable record size and moderately 
> poor hashing, overflow is inevitable.  So, do you want 80,000 extra groups, 
> or 20,000 overflow buffers? I would go with the smaller number.  But for the 
> love of Knuth, do not set your split.load to 90% unless you have a perfectly 
> hashed file with uniformly sized records.
> 
> -----Original Message-----
> From: u2-users-boun...@listserver.u2ug.org 
> [mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Chris Austin
> Sent: Tuesday, July 03, 2012 2:38 PM
> To: u2-users@listserver.u2ug.org
> Subject: Re: [U2] RESIZE - dynamic files
> 
> 
> This is why I'm confused.. Is the goal here to reduce 'overflow' or to 
> keep the 'Total size' of the disk down? If the goal is to keep the total
>  disk size down then it would appear
> you would want your actual load % a lot higher than 37%.. and then ignore 
> 'some' of the overflow..
> 
> Chris
> 
> 
> > But the total size of your file is up 60%.  Reading in 60% more records in 
> > a full select of the file is going to be much slower than a few more 
> > overflows.
> > 
> > 
> > -----Original Message-----
> > From: u2-users-boun...@listserver.u2ug.org 
> > [mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Chris Austin
> > Sent: Tuesday, July 03, 2012 2:15 PM
> > To: u2-users@listserver.u2ug.org
> > Subject: Re: [U2] RESIZE - dynamic files
> > 
> > 
> > Dan,
> > 
> > I changed the MINIMUM.MODULUS to the value of 200003 as you suggested and 
> > my Actual Load has really gone down (as well as overflow). See below for 
> > the results:
> > 
> > File name ..................   GENACCTRN_POSTED
> > Pathname ...................   GENACCTRN_POSTED
> > File type ..................   DYNAMIC
> > File style and revision ....   32BIT Revision 12
> > Hashing Algorithm ..........   GENERAL
> > No. of groups (modulus) ....   200003 current ( minimum 200003, 5263 empty,
> >                                             3957 overflowed, 207 badly )
> > Number of records ..........   1290469
> > Large record size ..........   3267 bytes
> > Number of large records ....   180
> > Group size .................   4096 bytes
> > Load factors ...............   90% (split), 50% (merge) and 37% (actual)
> > Total size .................   836235264 bytes
> > Total size of record data ..   287394719 bytes
> > Total size of record IDs ...   21508521 bytes
> > Unused space ...............   527323832 bytes
> > Total space for records ....   836227072 bytes
> > 
> > My overflow is now @ 2%
> > My Load is @ 37% (actual)
> > 
> > granted my empty groups are now up to almost 3% but I hope that won't be a 
> > big factor. How does this look?
> > 
> > Chris
> 
>                                         
> _______________________________________________
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
> _______________________________________________
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
                                          
_______________________________________________
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

Reply via email to