Hi, Am 25.08.2011 um 08:58 schrieb Hakan İlter:
> We are going to create a new Hadoop cluster in our company, i have to get > some advises from you: > > 1. Does anyone have stored whole Hadoop data not on local disks but > on Netapp or other storage system? Do we have to store datas on local disks, > if so is it because of performace issues? HDFS and MapReduce benefit massively from local storage, so using any kind of remote storage (SAN, Amazon S3, etc) will make things slower. > 2. What do you think about running Hadoop nodes in virtual (VMware) > servers? Virtualization can make certain things easier to handle, but it's a layer that will eat resources. Kai -- Kai Voigt [email protected]
