Hello,

can you run your code on standalone mode so you can be sure that the problem is 
not on your code?

Kindly,
Anastasis

On 6 Σεπ 2013, at 12:35 μ.μ., Mahesh Babu <[email protected]> wrote:

> Hi,
> 
> When I run a hama job in pseudo distributed mode (single node) I get
> following error: (in stdout)
>>>>>>>>>>>>> 
> attempt_201309061315_0005_000000_0: 13/09/06 14:01:39 DEBUG
> fs.FSInputChecker: DFSClient readChunk got seqno 593 offsetInBlock 38862848
> lastPacketInBlock false packetLen 66052
> attempt_201309061315_0005_000000_0: 13/09/06 14:01:39 DEBUG
> fs.FSInputChecker: DFSClient readChunk got seqno 594 offsetInBlock 38928384
> lastPacketInBlock false packetLen 66052
> attempt_201309061315_0005_000000_0: 13/09/06 14:01:40 DEBUG
> fs.FSInputChecker: DFSClient readChunk got seqno 595 offsetInBlock 38993920
> lastPacketInBlock false packetLen 66052
> attempt_201309061315_0005_000000_0: 13/09/06 14:01:40 DEBUG
> fs.FSInputChecker: DFSClient readC
> *13/09/06 14:03:29 INFO bsp.BSPJobClient: Job failed.*
> <<<<<<<<<<<<
> 
> 
> *hama-ubuntu-bspmaster-ubuntu.log*
>>>>>>>>>>>>> 
> 2013-09-06 14:03:21,422 DEBUG org.apache.hama.bsp.Counters: Adding
> SUPERSTEP_SUM
> 2013-09-06 14:03:23,423 DEBUG org.apache.hama.bsp.Counters: Adding
> SUPERSTEP_SUM
> 2013-09-06 14:03:25,424 DEBUG org.apache.hama.bsp.Counters: Adding
> SUPERSTEP_SUM
> *2013-09-06 14:03:25,425 INFO org.apache.hama.bsp.JobInProgress: Taskid
> 'attempt_201309061315_0005_000000_0' has failed.
> 2013-09-06 14:03:25,425 INFO org.apache.hama.bsp.TaskInProgress: Task
> 'task_201309061315_0005_000000' has failed.
> *2013-09-06 14:03:25,425 DEBUG org.apache.hama.bsp.JobInProgress: Removing
> /tmp/hadoop-ubuntu/bsp/local/bspMaster/job_201309061315_0005.xml and
> /tmp/hadoop-ubuntu/bsp/local/bspMaster/job_201309061315_0005.jar getJobFile
> = hdfs://localhost:9000/tmp/hadoop-ubuntu*/bsp/system/submit_714o6m/job.xml
> 2013-09-06 14:03:25,434 INFO org.apache.hama.bsp.JobInProgress: Job failed.
> 2013-09-06 14:03:25,434 DEBUG org.apache.hama.bsp.JobInProgress: Removing
> null and null getJobFile =
> hdfs://localhost:9000/tmp/hadoop-ubuntu/bsp/system/submit_714o6m/job.xml
> *<<<<<<<<<<<<<
> 
> *hama-ubuntu-groom-ubuntu.log*
>>>>>>>>>>>>> 
> 2013-09-06 14:03:14,660 DEBUG org.apache.hama.bsp.GroomServer: checking
> task: attempt_201309061315_0005_000000_0 starttime =1378456254247 lastping
> = 1378456334727 run state = RUNNING monitorPeriod = 10000 check = false
> 2013-09-06 14:03:24,660 DEBUG org.apache.hama.bsp.GroomServer: checking
> task: attempt_201309061315_0005_000000_0 starttime =1378456254247 lastping
> = 1378456334727 run state = RUNNING monitorPeriod = 10000 check = true
> 2013-09-06 14:03:24,660 INFO org.apache.hama.bsp.GroomServer: adding purge
> task: attempt_201309061315_0005_000000_0
> 2013-09-06 14:03:24,660 DEBUG org.apache.hama.bsp.GroomServer: Got 1
> oblivious tasks
> 2013-09-06 14:03:24,661 DEBUG org.apache.hama.bsp.GroomServer: Purging task
> org.apache.hama.bsp.GroomServer$TaskInProgress@2e0cd499
> *2013-09-06 14:03:24,661 INFO org.apache.hama.bsp.GroomServer: About to
> purge task: attempt_201309061315_0005_000000_0
> 2013-09-06 14:03:24,661 DEBUG org.apache.hama.bsp.GroomServer: Killing
> process for attempt_201309061315_0005_000000_0
> 2013-09-06 14:03:25,436 DEBUG org.apache.hama.bsp.GroomServer: Got Response
> from BSPMaster with 1 actions
> 2013-09-06 14:03:25,437 INFO org.apache.hama.bsp.GroomServer: Kill 1 tasks.
> *<<<<<<<<<<<<<
> 
> *attempt_201309061315_0005_000000_0.log*
>>>>>>>>>>>>> 
> 13/09/06 14:02:06 DEBUG ipc.RPC: Call: ping 2
> 13/09/06 14:02:07 DEBUG fs.FSInputChecker: DFSClient readChunk got seqno
> 633 offsetInBlock 41484288 lastPacketInBlock false packetLen 66052
> 13/09/06 14:02:14 DEBUG bsp.BSPTask: Pinging at time 1378456334726
> 13/09/06 14:02:14 DEBUG ipc.Client: IPC Client (47) connection to localhost/
> 127.0.0.1:49551 from ubuntu sending #24
> 13/09/06 14:02:14 DEBUG ipc.Client: IPC Client (47) connection to localhost/
> 127.0.0.1:49551 from ubuntu got value #24
> 13/09/06 14:02:14 DEBUG ipc.RPC: Call: ping 2
> 13/09/06 14:02:37 DEBUG bsp.BSPTask: Pinging at time 1378456357688
> 13/09/06 14:02:37 DEBUG ipc.Client: The ping interval is60000ms.
> 13/09/06 14:02:38 DEBUG ipc.Client: Use SIMPLE authentication for protocol
> BSPPeerProtocol
> 13/09/06 14:02:39 DEBUG ipc.Client: Connecting to localhost/127.0.0.1:49551
> 13/09/06 14:02:56 DEBUG ipc.Client: The ping interval is60000ms.
> 13/09/06 14:02:56 DEBUG ipc.Client: Use SIMPLE authentication for protocol
> ClientProtocol
> 13/09/06 14:02:57 DEBUG ipc.Client: Connecting to localhost/127.0.0.1:9000
> 13/09/06 14:02:58 DEBUG ipc.Client: IPC Client (47) connection to localhost/
> 127.0.0.1:49551 from ubuntu: closed
> 13/09/06 14:02:59 DEBUG ipc.Client: IPC Client (47) connection to localhost/
> 127.0.0.1:49551 from ubuntu: stopped, remaining connections 1
>>>>>>>>>>>>> 
> 
> Any idea why job is failing. No exceptions or failures in any logs even
> when I put the logs in DEBUG mode.
> 
> Thanks,
> Mahesh Babu

Reply via email to