[ http://issues.apache.org/jira/browse/HADOOP-195?page=comments#action_12378132 ]
paul sutter commented on HADOOP-195: ------------------------------------ ooops, i was wrong by a few orders of magnitude. but i was also right by a couple orders of magnitude the best feature of the map/reduce approach is that you only have to make one thing fast, and every program you write is fast. and that one thing is sort. so again,.. im very pleased that we're taking a look at the sort path! once the copy phase is fixed, the next step is for the Yahoo guys to contribute David Cossock's sort ;) > transfer map output transfer with http instead of rpc > ----------------------------------------------------- > > Key: HADOOP-195 > URL: http://issues.apache.org/jira/browse/HADOOP-195 > Project: Hadoop > Type: Improvement > Components: mapred > Versions: 0.2 > Reporter: Owen O'Malley > Assignee: Owen O'Malley > Fix For: 0.3 > > The data transfer of the map output should be transfered via http instead > rpc, because rpc is very slow for this application and the timeout behavior > is suboptimal. (server sends data and client ignores it because it took more > than 10 seconds to be received.) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
