Yes the low numbers are used more often.
However if you have sequential keys, just use the *last* two digits instead of 
the first two



-----Original Message-----
From: Wols Lists <antli...@youngman.org.uk>
To: u2-users <u2-users@listserver.u2ug.org>
Sent: Tue, Oct 2, 2012 1:17 am
Subject: Re: [U2] [u2] Parallel processing in Universe


On 01/10/12 22:47, Robert Houben wrote:
> Create an index on a dict pointing at the first character of the key, and 
> have 
each phantom take two digits. (0-1, 2-3, 4-5, 6-7, 8-9)
> 
Actually, this is a very BAD way of chopping up a file into five even
chunks.

I'm not sure of the stats, but on any file with sequential keys, the
first phantom will get the majority of the records, the second get the
majority of what's left, etc etc.

A lot of people make the mistake of thinking this is a good technique.
I'm not even sure it works well with random numbers...

Cheers,
Wol
_______________________________________________
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

 
_______________________________________________
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

Reply via email to