On Apr 26, 2010, at 6:23 AM, Druilhe Remi wrote:
> For example, when I run "wordcount" example, there is HDFS communications and 
> MapReduce communications and I am not able to distinguish which packet belong 
> to HDFS or to MapReduce.

This shouldn't be too surprising given that the MapReduce job needs to talk to 
HDFS to determine input and to write output.

> A way could be to use odd port number for HDFS and even port number for 
> MapReduce, but I think I have to modify source code.

The ports for the services are already separated out.  

In general, client -> server connections map out as:

MR -> MR, HDFS
HDFS -> HDFS

Given a small 3 node grid, a dump of what processes open what ports, and what 
connections are made between all the machines, it should be trivial to make a 
more complex connection map.  [You can probably even do it as a map reduce job. 
:) ]

Reply via email to