Hi all, In my two-datanode cluster, I require that file operations on the underlying filesystem take place in the same order. Essentially, I wish for blocks to be created, written, and/or deleted deterministically across datanodes.
However, this is not the case towards the end of the TestDFSIO benchmark. Several blocks are deleted, but each datanode performs this deletion at a *different time* relative to the last few blocks being written. What component is initiating the block deletion at the end of the benchmark? (It seems to be the Replication Monitor, but I'm unclear on what causes the Replication Monitor to suddenly run and delete blocks at the end of the benchmark). I am using Hadoop 1.0.4. Thank you, Eitan Rosenfeld