In detail:
'and these people's job never hanged...'

these people's map and reduce tasks never hanged.


On Fri, Jan 3, 2014 at 1:46 PM, Azuryy Yu <azury...@gmail.com> wrote:

> Hi Harsh,
> Thanks.
>
> There is no any error logs for attempt_201312201200_34795_r_000000_0 in
> the tasktracker log. only '0.0% reduce > copy >'
>
> I configured all hosts in all slaves and master.
>
> This job has only one reduce. it hanged. but I configured everybody's max
> job running to '1' in the Fair scheduler file.
>
> but some people's max job running greater than one. and these people's job
> never hanged...
>
>
> On Fri, Jan 3, 2014 at 1:13 PM, Harsh J <ha...@cloudera.com> wrote:
>
>> Does the Reduce task log (of attempt_201312201200_34795_r_000000_0)
>> show any errors in trying to communicate with the various TaskTrackers
>> in trying to obtain the data?
>>
>> On Fri, Jan 3, 2014 at 9:54 AM, Azuryy Yu <azury...@gmail.com> wrote:
>> > Add addtional:
>> >
>> > Our MR version is 1.2.1, not 1.0.4
>> >
>> > There is no useful information in the JT log.
>> >
>> >
>> > On Fri, Jan 3, 2014 at 12:20 PM, Azuryy Yu <azury...@gmail.com> wrote:
>> >>
>> >> Hi,
>> >>
>> >> Our prod cluster met some issues recently,
>> >> All map tasks finished successfully, but reduce task hanged.
>> >>
>> >> but It's not happened on all TaskTrackers, only sometimes. we used
>> >> mapred-1.0.4
>> >>
>> >> There is "0.0% reduce > copy >" forever until kill task manually.
>> >>
>> >> reduce logs on the TaskTracker:
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:13:57 INFO
>> >> mapred.TaskTracker: JVM with ID: jvm_201312201200_34795_r_-365330778
>> given
>> >> task: attempt_201312201200_34795_r_000000_0
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:14:04 INFO
>> >> mapred.TaskTracker: attempt_201312201200_34795_r_000000_0 0.0% reduce
>> > copy
>> >> >
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:14:08 INFO
>> >> mapred.TaskTracker: attempt_201312201200_34795_r_000000_0 0.0% reduce
>> > copy
>> >> >
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:14:14 INFO
>> >> mapred.TaskTracker: attempt_201312201200_34795_r_000000_0 0.0% reduce
>> > copy
>> >> >
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:14:17 INFO
>> >> mapred.TaskTracker: attempt_201312201200_34795_r_000000_0 0.0% reduce
>> > copy
>> >> >
>> >> hadoop-hadoop-tasktracker-10-200-91-186.out:14/01/03 06:14:23 INFO
>> >> mapred.TaskTracker: attempt_201312201200_34795_r_000000_0 0.0% reduce
>> > copy
>> >> >
>> >
>> >
>>
>>
>>
>> --
>> Harsh J
>>
>
>

Reply via email to