Group size appears adequate (although anytime anything hashes into the group(s) 
with the largest record [3267b], you'll split: 3267 is 79.8% of 4096, so if you 
have a lot of records up in the 3K range, you may want to increase group size 
and decrease min modulus accordingly), but the minimum modulus should be a 
prime north of the current modulus, with a padding factor based on growth 
expectations. The sweet spot is where you have enough data in each group to 
avoid merging (I'd argue that 50% is a bit high for the merge; but that's 
because I'm unafraid of unused space, while I'm averse to file maintenance 
overhead), but not so much that you do a lot of splitting. You should do a 
count on the number of records, too. It almost never makes sense to have the 
modulus exceed the number of records by a substantial percentage.
 
So, you should increase minimum modulus to 83267 or higher, unless you double 
the group size to 8K, in which case something around 50K as a modulus sounds 
good. I'd take the merge down a little, to maybe 30% or even less, and maybe 
knock the split up a bit - say, 90% - to cut down on the splitting.
 
> From: cjausti...@hotmail.com
> To: u2-users@listserver.u2ug.org
> Date: Mon, 2 Jul 2012 14:55:21 -0500
> Subject: [U2] RESIZE - dynamic files
> 
> 
> I was wondering if anyone had instructions on RESIZE with a dynamic file? For 
> example I have a file called 'TEST_FILE'
> with the following:
> 
> 01 ANALYZE.FILE TEST_FILE
> File name ..................   TEST_FILE
> Pathname ...................   TEST_FILE
> File type ..................   DYNAMIC
> File style and revision ....   32BIT Revision 12
> Hashing Algorithm ..........   GENERAL
> No. of groups (modulus) ....   83261 current ( minimum 31 )
> Large record size ..........   3267 bytes
> Group size .................   4096 bytes
> Load factors ...............   80% (split), 50% (merge) and 80% (actual)
> Total size .................   450613248 bytes
> 
> How
>  do you calculate what the modulus and separation should be? I can't use
>  HASH.HELP on a type 30 file to see the recommended settings
> so I was wondering how best you figure out the file RESIZE.
> 
> Thanks,
> 
> Chris                                           
> _______________________________________________
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
                                          
_______________________________________________
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

Reply via email to