On Aug 10, 2011, at 7:56 AM, Evert Lammerts wrote:

> A short, slightly off-topic question:
> 
>>      Also note that in this configuration that one cannot take
>> advantage of the "keep the machine up at all costs" features in newer
>> Hadoop's, which require that root, swap, and the log area be mirrored
>> to be truly effective.  I'm not quite convinced that those features are
>> worth it yet for anything smaller than maybe a 12 disk config.
> 
> Dell and Cloudera promote the C2100. I'd like to see the calculations behind 
> that config.

        If Dell is shipping the same box they shipped us to test a few months 
ago, the performance was pretty horrid vs. almost all their competitors.  The 
main problem was the controller--it was built for RAID, not for JBOD.  (... and 
then there is the OOB support...)


> Am I wrong thinking that keeping your cluster up with such dense nodes will 
> only work if you have many (order of magnitude 100+) of them, and 
> interconnected with 10Gb Ethernet? If you don't then recovery times from 
> failing disks / rack switches are going to get crazy, right?

        If one assumes that a bunch of nodes are failing at once, yes.  The 
irony is that ops teams tend to group repairs, so keeping them up might 
actually be the wrong thing in relation to actual practice.

> If you want to get bang for buck, don't the proportions "disk IO / processing 
> power", "node storage capacity / ethernet speed" and "total amount of nodes / 
> ethernet speed", indicate many small nodes with not too many disks and 1Gb 
> Ethernet?

        The biggest constraint is almost always RAM, as you can use it to help 
with the rest.

Reply via email to