Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change 
notification.

The "FAQ" page has been changed by SomeOtherAccount.
http://wiki.apache.org/hadoop/FAQ?action=diff&rev1=72&rev2=73

--------------------------------------------------

  
  <<BR>> <<Anchor(3)>> '''3. [[#A3|How well does Hadoop scale?]]'''
  
- Hadoop has been demonstrated on clusters of up to 2000 nodes.  Sort 
performance on 900 nodes is good (sorting 9TB of data on 900 nodes takes around 
1.8 hours) and [[attachment:sort900-20080115.png|improving]] using these 
non-default configuration values:
+ Hadoop has been demonstrated on clusters of up to 4000 nodes.  Sort 
performance on 900 nodes is good (sorting 9TB of data on 900 nodes takes around 
1.8 hours) and [[attachment:sort900-20080115.png|improving]] using these 
non-default configuration values:
  
   * `dfs.block.size = 134217728`
   * `dfs.namenode.handler.count = 40`
@@ -276, +276 @@

  
  It appears that DatanodeID.getHost() is the standard place to retrieve this 
name, and the machineName variable, populated in DataNode.java\#startDataNode, 
is where the name is first set. The first method attempted is to get 
"slave.host.name" from the configuration; if that is not available, 
DNS.getDefaultHost is used instead.
  
+ <<BR>> <<Anchor(31)>> '''31. [[#A31|On an individual data node, how do you 
balance the blocks on the disk?]]'''
+ 
+ Hadoop currently does not have a method by which to do this automatically.  
To do this manually:
+ 
+  1. Take down the HDFS
+  2. Use the UNIX mv command to move the individual blocks and meta pairs from 
one directory to another on each host
+  3. Restart the HDFS
+ 

Reply via email to