HDFS: buffer before contacts Namenode?

elton sky Mon, 09 Aug 2010 19:14:48 -0700

hello folks,

I can see from the design doc of HDFS, says: client will buffer a block size
worth of data before contacting namenode for data node info. This is a
network throughput optimal way.
However, I could not find this buffer processing procedure in source code.


In DFSClient.DataStreamer, it waits for dataqueue to be not empty and starts
to request namenode and build a pipeline. The number of packets in the
dataqueue is always 1 when this happens!
I am confused here. Can anyone address this if I am wrong?

HDFS: buffer before contacts Namenode?

Reply via email to