[ 
https://issues.apache.org/jira/browse/TEZ-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Zhang resolved TEZ-1387.
-----------------------------
    Resolution: Won't Fix

Close this jira as disk Issue has been propagated to diagnostics.  

> Add proper diagonstics message for disk issues
> ----------------------------------------------
>
>                 Key: TEZ-1387
>                 URL: https://issues.apache.org/jira/browse/TEZ-1387
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Yesha Vora
>
> Tez prints 'java.io.IOException: Spill failed' message where disks are full. 
> It should print better diagnostic message such as  "disk is full" .
> {noformat}
> 2014-06-13 12:09:37,202 INFO [main] 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter: (EQUATOR) 
> 146679246 kvi 36669804(146679216)
> 2014-06-13 12:09:37,383 WARN [SpillThread [finalreduce_] 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter: Got an 
> exception in sortAndSpill
> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any 
> valid local directory for 
> attempt_1402677732456_0109_1_00_000003_1_10003_spill_0.out
>         at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:402)
>         at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
>         at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
>         at 
> org.apache.tez.runtime.library.common.task.local.output.TezTaskOutputFiles.getSpillFileForWrite(TezTaskOutputFiles.java:183)
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter.spill(DefaultSorter.java:739)
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter.sortAndSpill(DefaultSorter.java:723)
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter$SpillThread.run(DefaultSorter.java:655)
> 2014-06-13 12:09:37,389 INFO [main] 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask: Final Counters : 
> Counters: 10 [[org.apache.tez.common.counters.TaskCounter 
> SPLIT_RAW_BYTES=221, SPILLED_RECORDS=0, INPUT_RECORDS_PROCESSED=16398, 
> OUTPUT_RECORDS=16397, OUTPUT_BYTES=173778252, OUTPUT_BYTES_WITH_OVERHEAD=0, 
> OUTPUT_BYTES_PHYSICAL=0, ADDITIONAL_SPILLS_BYTES_WRITTEN=0, 
> ADDITIONAL_SPILLS_BYTES_READ=0, ADDITIONAL_SPILL_COUNT=0]]
> 2014-06-13 12:09:37,390 INFO [Tez Container Heartbeat Thread 
> [container_1402677732456_0109_01_000021]] 
> org.apache.hadoop.mapred.YarnTezDagChild: Heartbeat thread interrupted.  
> stopped: true error: false
> 2014-06-13 12:09:37,390 INFO [Tez Container Heartbeat Thread 
> [container_1402677732456_0109_01_000021]] 
> org.apache.hadoop.mapred.YarnTezDagChild: Current task marked as complete. 
> Stopping heartbeat thread and allowing normal container shutdown
> 2014-06-13 12:09:37,390 FATAL [main] 
> org.apache.hadoop.mapred.YarnTezDagChild: Error running child : 
> java.io.IOException: Spill failed
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter.checkSpillException(DefaultSorter.java:686)
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter.collect(DefaultSorter.java:211)
>         at 
> org.apache.tez.runtime.library.common.sort.impl.dflt.DefaultSorter.write(DefaultSorter.java:185)
>         at 
> org.apache.tez.runtime.library.output.OnFileSortedOutput$1.write(OnFileSortedOutput.java:116)
>         at 
> org.apache.tez.mapreduce.processor.map.MapProcessor$NewOutputCollector.write(MapProcessor.java:373)
>         at 
> org.apache.tez.mapreduce.hadoop.mapreduce.TaskInputOutputContextImpl.write(TaskInputOutputContextImpl.java:90)
>         at 
> org.apache.hadoop.mapreduce.lib.map.WrappedMapper$Context.write(WrappedMapper.java:112)
>         at org.apache.hadoop.mapreduce.Mapper.map(Mapper.java:124)
>         at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145)
>         at 
> org.apache.tez.mapreduce.processor.map.MapProcessor.runNewMapper(MapProcessor.java:247)
>         at 
> org.apache.tez.mapreduce.processor.map.MapProcessor.run(MapProcessor.java:134)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to