Greetings Doug, This is where my confusion is. The formula that I'm using to compute the split load does not change, regardless of the block size. And yet you're saying changing my block size to 2k will change it from 1% to 6%.
So, here's my formula, based on the very informative FAQ from Rocket: SPLIT = INT(RECORDS PER BLOCK * IDSIZE * 100 / BLOCKSIZE) SPLIT = INT(1 * 21 * 100 / 2048) = 1 SPLIT = INT(2 * 21 * 100 / 4096) = 1 Given this formula, the split will never change, regardless of the block size because the RECORDS PER BLOCK will generally increase proportionally to the BLOCK SIZE. Sincerely, David Laansma IT Manager Hubbard Supply Co. Direct: 810-342-7143 Office: 810-234-8681 Fax: 810-234-6142 www.hubbardsupply.com "Delivering Products, Services and Innovative Solutions" -----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of Doug Averch Sent: Friday, June 29, 2012 1:38 PM To: U2 Users List Subject: Re: [U2] Really trying to understand dynamic file sizing Hi Dave: You cannot get any lower than one percent. If you set your block size to 2K that will fix the split problem making about 6%. Your average record size is 1537 which means you will get about one record per block so split by key will be worthless and you will be into Level 2 and/or Level 1 overflow. I have found if you spend a lot of time on one file try to optimize it, then you had 100K worth of records to the file and your work is does not matter. What you want to do is get the file in best shape you can considering growth and usability, then look at this file or all your files in a week or month and see what has happened. Regards, Doug www.u2logic.com "XLr8Resizer for fast resizing" _______________________________________________ U2-Users mailing list [email protected] http://listserver.u2ug.org/mailman/listinfo/u2-users _______________________________________________ U2-Users mailing list [email protected] http://listserver.u2ug.org/mailman/listinfo/u2-users
