Anatomy of read in hdfs

Sidharth Kumar Thu, 06 Apr 2017 12:56:45 -0700

Hi Genies,

I have a small doubt that hdfs read operation is parallel or sequential
process. Because from my understanding it should be parallel but if I read
"hadoop definitive guide 4" in anatomy of read it says "*Data is streamed
from the datanode back **to the client, which calls read() repeatedly on
the stream (step 4). When the end of the **block is reached, DFSInputStream
will close the connection to the datanode, then find **the best datanode
for the next block (step 5). This happens transparently to the client, **which
from its point of view is just reading a continuous stream*."


So can you kindly explain me how read operation will exactly happens.


Thanks for your help in advance

Sidharth

Anatomy of read in hdfs

Reply via email to