[ 
https://issues.apache.org/jira/browse/HADOOP-4264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12636099#action_12636099
 ] 

Suman Sehgal commented on HADOOP-4264:
--------------------------------------

Its giving the same AlreadyBeingCreatedException.
2008-09-26 19:52:18,590 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from attempt_200809261929_0005_m_000001_0_1222457384229: Task 
attempt_200809261929_0005_m_000001_0_1222457384229 failed to report status for 
603 seconds. Killing!
2008-09-26 19:52:18,591 INFO org.apache.hadoop.mapred.JobTracker: Removed 
completed task 'attempt_200809261929_0005_m_000001_0_1222457384229' from 
'tracker_host:host/xx.xxx.xx.xxx:xxxxx'

2008-09-26 19:57:24,546 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from attempt_200809261929_0005_m_000001_1_1222457384229: 
org.apache.hadoop.ipc.RemoteException: 
org.apache.hadoop.hdfs.protocol.AlreadyBeingCreatedException: failed to create 
file /benchmarks/TestDFSIO/io_data/test_io_101 for 
DFSClient_attempt_200809261929_0005_m_000001_1_1222457384229 on client 
xx.xxx.xx.xxx, because this file is already being created by 
DFSClient_attempt_200809261929_0005_m_000001_0_1222457384229 on xx.xxx.xx.xxx

        



> DFSIO is failing on 500 nodes cluster
> -------------------------------------
>
>                 Key: HADOOP-4264
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4264
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: io, test
>    Affects Versions: 0.19.0
>            Reporter: Suman Sehgal
>
> On executing following command : 
> bin/hadoop jar ~/hadoop/hadoop-0.19.0-test.jar TestDFSIO -write -nrFiles 990 
> -fileSize 320     
> This error occurs:
> 08/09/24 06:15:03 INFO mapred.JobClient:  map 98% reduce 32%
> java.io.IOException: Job failed!
>       at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1201)
>       at org.apache.hadoop.fs.TestDFSIO.runIOTest(TestDFSIO.java:236)
>       at org.apache.hadoop.fs.TestDFSIO.writeTest(TestDFSIO.java:218)
>       at org.apache.hadoop.fs.TestDFSIO.main(TestDFSIO.java:354)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>       at java.lang.reflect.Method.invoke(Method.java:597)
>       at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
>       at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
>       at org.apache.hadoop.test.AllTestDriver.main(AllTestDriver.java:77)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>       at java.lang.reflect.Method.invoke(Method.java:597)
>       at org.apache.hadoop.util.RunJar.main(RunJar.java:165)
>       at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54)
>       at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>       at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
>       at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
> On looking at hadoop logs, It seems that file names are clashing
> 2008-09-24 06:21:41,618 INFO org.apache.hadoop.mapred.JobTracker: Removed 
> completed task 'attempt_200809240600_0005_m_000802_2_1222236048515' from 
> 'tracker_xxxx/client x.x.x.x:xxxxx'
> 2008-09-24 06:21:41,627 INFO org.apache.hadoop.mapred.JobTracker: Adding task 
> 'attempt_200809240600_0005_m_000802_4_1222236048515' to tip 
> task_200809240600_0005_m_000802, for tracker 'tracker_xxxx/client 
> x.x.x.x:xxxxx'
> 2008-09-24 06:21:41,627 INFO org.apache.hadoop.mapred.JobInProgress: Choosing 
> rack-local task task_200809240600_0005_m_000802
> 2008-09-24 06:21:41,724 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
> from attempt_200809240600_0005_m_000900_2_1222236048515: 
> org.apache.hadoop.ipc.RemoteException: 
> org.apache.hadoop.hdfs.protocol.AlreadyBeingCreatedException: failed to 
> create file /benchmarks/TestDFSIO/io_data/test_io_20 for 
> DFSClient_attempt_200809240600_0005_m_000900_2_1222236048515 on client client 
> x.x.x.x, because this file is already being created by 
> DFSClient_attempt_200809240600_0005_m_000900_0_1222236048515 on client x.x.x.x

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to