Thanks Natarajan!
[email protected] From: Natarajan, Prabakaran 1. (NSN - IN/Bangalore) Date: 2014-12-19 12:22 To: [email protected] Subject: RE: Question about the behavior of HDFS. Where ever you upload, it upload evenly to all machines. Namenode will not have data but has only the metadata From: ext [email protected] [mailto:[email protected]] Sent: Friday, December 19, 2014 9:19 AM To: user Subject: Question about the behavior of HDFS. Hi Hadoopers, I got a question about the behavior of HDFS. Say, there are 1 namenode and 10 data nodes. On the namenode machine, i upload a 1G file to HDFS. Will this 1G file be distributed evenly to the data nodes, and there is no data stored on the namenode? If I upload the the data from the data node, will the file still distributed evenly to all the data nodes ? I think if most of the data reside on the node that i upload the data, it will save the network, but this leads to another problem, when MR this file, most of time will be spent on this node because it has to process most of the data. [email protected]
