To write data to HDFS, you should be able to connect to both Namenode and to all of the datanodes. NameNode decides where to place the data blocks and tells the client (you) to send the blocks to corresponding datanodes.
Raghu. Phantom wrote:
Hi When I write data into HDFS do I always need to connect to the datanode to write the data ? Can I connect to any namenode to do so ? If so how does the datanode keep track of where the various chunks are going to be placed ? Any resource that I can look at or could someone explain this to me ? Thanks A
