Hi Avery,

thanks. It worked at least once now :).

Aapo

On Oct 1, 2011, at 2:27 AM, Avery Ching wrote:

> Hi Aapo,
> 
> Thanks for the error report.  I think you found a bug.  Can you try the 
> included patch and see if the problem goes away?  I got it to     pass local 
> and MR unittests.
> 
> Avery
> 
> On 9/30/11 4:08 PM, Aapo Kyrola wrote:
>> 
>> Hi,
>> 
>> occasionally (maybe one time in four), my giraph run fails because of the 
>> below RuntimeException.
>> According to code, it should never happen:
>> 
>>  if (msgMap == null)
>>             { // should never happen after constructor throw new
>>             RuntimeException( "sendMessage: msgMap did not exist for " +
>>             addr + " for vertex " + destVertex); }
>> 
>> 
>>         
>> This
>>           happens during superstep 1 (second superstep). My application
>>           actually *adds* edges on superstep 1
>> (to
>>           make every out-edge also an in-edge of the destination), but
>>           since I am running only on 3 workers,
>> I am
>>           surprised if every worker would not had been registered in the
>>           RPC layer initially.
>> 
>> 
>>         
>> One
>>           hypothesis is that Hadoop does something funny, because one of
>>           my server was under heavy
>> load.
>>           Maybe Hadoop launched another worker to replace a slow worker?
>>           Can it happen?
>> 
>> 
>>           
>> java.lang.RuntimeException: sendMessage: msgMap did not exist for 
>> [hostname].ml.cmu.edu:30003 for vertex 875713
>>         at 
>> org.apache.giraph.comm.BasicRPCCommunications.sendMessageReq(BasicRPCCommunications.java:825)
>>         at org.apache.giraph.graph.BasicVertex.sendMsg(BasicVertex.java:179)
>>         at 
>> edu.cmu.selectlab.BP.BinaryBPVertex.compute(BinaryBPVertex.java:94)
>>         at org.apache.giraph.graph.GraphMapper.map(GraphMapper.java:624)
>>         at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
>>         at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:763)
>>         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:369)
>>         at org.apache.hadoop.mapred.Child$4.run(Child.java:259)
>>         at java.security.AccessController.doPrivileged(Native Method)
>>         at javax.security.auth.Subject.doAs(Subject.java:396)
>>         at 
>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
>>         at org.apache.hadoop.mapred.Child.main(Child.java:253)
>> 
>> 
>> Aapo Kyrola
>> Ph.D. student, http://www.cs.cmu.edu/~akyrola
>> 
> 
> <diff.txt>

Aapo Kyrola
Ph.D. student, http://www.cs.cmu.edu/~akyrola

Reply via email to