I am a newbie.
Isn't clear to me if I should post this type of issue to dev or user, so I
decided to post to [email protected]

We have a reproducible script that works with -x mapreduce, fails with -x
tez

Failure while running
task:org.apache.pig.backend.executionengine.ExecException: ERROR 0: Input
from vertex scope-108 is missing
[snip]
pigstats.tez.TezPigScriptStats - Script Statistics:

       HadoopVersion: 2.5.0-cdh5.3.2
          PigVersion: 0.15.0-SNAPSHOT
          TezVersion: 0.6.0
              UserId: root
            FileName: foo.pig
           StartedAt: 2015-03-27 08:47:40
          FinishedAt: 2015-03-27 08:49:08
            Features: HASH_JOIN,GROUP_BY,FILTER,UNION

Failed!

Q: Is this a known issue?
Q: What kind of diagnostics/info can I provide to assist?

Michael


full backtrace

, progress=TotalTasks: 497 Succeeded: 24 Running: 0 Failed: 1 Killed: 472
FailedTaskAttempts: 11, diagnostics=Vertex failed, vertexName=scope-122,
vertexId=vertex_1427151599024_0195_1_08, diagnostics=[Task failed,
taskId=task_1427151599024_0195_1_08_000001, diagnostics=[TaskAttempt 0
failed, info=[Error: Failure while running
task:org.apache.pig.backend.executionengine.ExecException: ERROR 0: Input
from vertex scope-108 is missing
        at
org.apache.pig.backend.hadoop.executionengine.tez.plan.udf.ReadScalarsTez.attachInputs(ReadScalarsTez.java:77)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.initializeInputs(PigProcessor.java:293)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.run(PigProcessor.java:183)
        at
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:324)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:176)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:168)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:168)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:163)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
], TaskAttempt 1 failed, info=[Error: Failure while running
task:org.apache.pig.backend.executionengine.ExecException: ERROR 0: Input
from vertex scope-108 is missing
        at
org.apache.pig.backend.hadoop.executionengine.tez.plan.udf.ReadScalarsTez.attachInputs(ReadScalarsTez.java:77)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.initializeInputs(PigProcessor.java:293)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.run(PigProcessor.java:183)
        at
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:324)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:176)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:168)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:168)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:163)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
], TaskAttempt 2 failed, info=[Error: Failure while running
task:org.apache.pig.backend.executionengine.ExecException: ERROR 0: Input
from vertex scope-108 is missing
        at
org.apache.pig.backend.hadoop.executionengine.tez.plan.udf.ReadScalarsTez.attachInputs(ReadScalarsTez.java:77)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.initializeInputs(PigProcessor.java:293)
        at
org.apache.pig.backend.hadoop.executionengine.tez.runtime.PigProcessor.run(PigProcessor.java:183)
        at
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:324)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:176)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:168)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:168)
        at
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:163)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
], TaskAttempt 3 failed, info=[Container
container_1427151599024_0195_01_000010 finished with diagnostics set to
[Container preempted internally]]], Vertex failed as one or more tasks
failed. failedTasks:1, Vertex vertex_1427151599024_0195_1_08 [scope-122]
killed/failed due to:null]
Vertex killed, vertexName=scope-129,
vertexId=vertex_1427151599024_0195_1_09, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_09 [scope-129]
killed/failed due to:null]
Vertex killed, vertexName=scope-130,
vertexId=vertex_1427151599024_0195_1_10, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_10 [scope-130]
killed/failed due to:null]
Vertex killed, vertexName=scope-128,
vertexId=vertex_1427151599024_0195_1_07, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_07 [scope-128]
killed/failed due to:null]
Vertex killed, vertexName=scope-126,
vertexId=vertex_1427151599024_0195_1_05, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_05 [scope-126]
killed/failed due to:null]
Vertex killed, vertexName=scope-125,
vertexId=vertex_1427151599024_0195_1_06, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_06 [scope-125]
killed/failed due to:null]
Vertex killed, vertexName=scope-115,
vertexId=vertex_1427151599024_0195_1_04, diagnostics=[Vertex received Kill
while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0, Vertex vertex_1427151599024_0195_1_04 [scope-115]
killed/failed due to:null]
DAG failed due to vertex failure. failedVertices:1 killedVertices:6,
counters=Counters: 43
        org.apache.tez.common.counters.DAGCounter
                NUM_FAILED_TASKS=11
                NUM_KILLED_TASKS=36
                NUM_SUCCEEDED_TASKS=24
                TOTAL_LAUNCHED_TASKS=51
                OTHER_LOCAL_TASKS=1
                DATA_LOCAL_TASKS=21
        File System Counters
                FILE_BYTES_READ=187910630
                FILE_BYTES_WRITTEN=3859160973
                FILE_READ_OPS=0
                FILE_LARGE_READ_OPS=0
                FILE_WRITE_OPS=0
                HDFS_BYTES_READ=2619256965
                HDFS_BYTES_WRITTEN=0
                HDFS_READ_OPS=21
                HDFS_LARGE_READ_OPS=0
                HDFS_WRITE_OPS=0
        org.apache.tez.common.counters.TaskCounter
                SPILLED_RECORDS=45455352
                NUM_SHUFFLED_INPUTS=3
                NUM_FAILED_SHUFFLE_INPUTS=0
                GC_TIME_MILLIS=12756
                CPU_MILLISECONDS=1366940
                PHYSICAL_MEMORY_BYTES=15069728768
                VIRTUAL_MEMORY_BYTES=37774110720
                COMMITTED_HEAP_BYTES=18567659520
                INPUT_RECORDS_PROCESSED=36917387
                OUTPUT_RECORDS=83870869
                OUTPUT_LARGE_RECORDS=0
                OUTPUT_BYTES=8898320232
                OUTPUT_BYTES_WITH_OVERHEAD=8241464266
                OUTPUT_BYTES_PHYSICAL=3671993306
                ADDITIONAL_SPILLS_BYTES_WRITTEN=62138713
                ADDITIONAL_SPILLS_BYTES_READ=187158971
                ADDITIONAL_SPILL_COUNT=18
                SHUFFLE_BYTES=233406
                SHUFFLE_BYTES_DECOMPRESSED=493440
                SHUFFLE_BYTES_TO_MEM=233406
                SHUFFLE_BYTES_TO_DISK=0
                SHUFFLE_BYTES_DISK_DIRECT=0
                SHUFFLE_PHASE_TIME=689
                FIRST_EVENT_RECEIVED=228
                LAST_EVENT_RECEIVED=235
        org.apache.hadoop.mapreduce.TaskCounter
                COMBINE_INPUT_RECORDS=20829242
                COMBINE_OUTPUT_RECORDS=23476241
2015-03-27 08:49:07,439 [PigTezLauncher-0] INFO
 org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is
deprecated. Instead, use fs.defaultFS
2015-03-27 08:49:07,597 [PigTezLauncher-0] INFO
 org.apache.pig.tools.pigstats.JobStats - using output size reader:
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader
2015-03-27 08:49:08,220 [main] INFO
 org.apache.pig.tools.pigstats.tez.TezPigScriptStats - Script Statistics:

       HadoopVersion: 2.5.0-cdh5.3.2
          PigVersion: 0.15.0-SNAPSHOT
          TezVersion: 0.6.0
              UserId: root
            FileName: foo.pig
           StartedAt: 2015-03-27 08:47:40
          FinishedAt: 2015-03-27 08:49:08
            Features: HASH_JOIN,GROUP_BY,FILTER,UNION

Failed!

DAG PigLatin:foo.pig-0_scope-0:
       ApplicationId: job_1427151599024_0195
  TotalLaunchedTasks: 51
       FileBytesRead: 187910630
    FileBytesWritten: 3859160973
       HdfsBytesRead: 2619256965
    HdfsBytesWritten: 0

Input(s):
Failed to read data from "pig.XXXX"
Successfully read 1000 records (171668 bytes) from: "pig.YYYY_sample"
Successfully read 13438146 records (483689478 bytes) from: "pig.ZZZZ"

Output(s):
Failed to produce result in "hdfs://
XXXX.YYYY.com:8020/tmp/temp-498428122/tmp-576253744"

2015-03-27 08:49:08,302 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 1066: Unable to open iterator for alias cntallclaims_patient
Details at logfile: /root/PIG/pig_1427460455726.log
2015-03-27 08:49:08,322 [main] INFO  org.apache.pig.Main - Pig script
completed in 1 minute, 32 seconds and 719 milliseconds (92719 ms)
2015-03-27 08:49:08,344 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.tez.TezLauncher - Shutting
down thread pool
2015-03-27 08:49:08,394 [Thread-37] INFO
 org.apache.pig.backend.hadoop.executionengine.tez.TezSessionManager -
Shutting down Tez session org.apache.tez.client.TezClient@37bdce24
2015-03-27 08:49:08,395 [Thread-37] INFO  org.apache.tez.client.TezClient -
Shutting down Tez Session, sessionName=PigLatin:foo.pig,
applicationId=application_1427151599024_0195
[root@ludwig PIG]#

Reply via email to