One correction - the number 5 in the mail below is my estimation of the number of nodes we might need. Can this be too small a cluster?
Arijit Dr. Arijit Mukherjee Principal Member of Technical Staff, Level-II Connectiva Systems (I) Pvt. Ltd. J-2, Block GP, Sector V, Salt Lake Kolkata 700 091, India Phone: +91 (0)33 23577531/32 x 107 http://www.connectivasystems.com -----Original Message----- From: Arijit Mukherjee [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 04, 2008 3:47 PM To: [email protected] Subject: Hadoop hardware specs Hi All We're thinking of setting up a Hadoop cluster which will be used to create a prototype system for analyzing telecom data. The wiki page on machine scaling (http://wiki.apache.org/hadoop/MachineScaling) gives an overview of the node specs and from the Hadoop primer I found the following specs - * 5 x dual core CPUs * RAM - 4-8GB; ECC preferred, though more expensive * 2 x 250GB SATA drives (on each of the 5 nodes) * 1-5 TB external storage I'm curious to find out what sort of specs do people use normally. Is the external storage essential or will the individual disks on each node be sufficient? Why would you need an external storage in a hadoop cluster? How can I find out what other projects on hadoop are using? Cheers Arijit Dr. Arijit Mukherjee Principal Member of Technical Staff, Level-II Connectiva Systems (I) Pvt. Ltd. J-2, Block GP, Sector V, Salt Lake Kolkata 700 091, India Phone: +91 (0)33 23577531/32 x 107 http://www.connectivasystems.com No virus found in this incoming message. Checked by AVG - http://www.avg.com Version: 8.0.175 / Virus Database: 270.8.6/1765 - Release Date: 11/3/2008 4:59 PM
