This looks like PIG-3985-
https://issues.apache.org/jira/browse/PIG-3985


On Tue, Oct 7, 2014 at 3:35 PM, Matt Bossenbroek <
mbossenbr...@netflix.com.invalid> wrote:

> Played around with this some more. Got some interesting results.
>
> It turns out that having two STORE commands in the script is what is
> causing it to fail. If I comment out either of them, the script will run
> and produce the other result. Because of that, we know that the code used
> in both of those paths is ok.
>
> Also, if I copy the script & run it from the grunt prompt, both outputs
> work fine. I imagine this is because the prompt runs one output at a time.
>
> I'd say at this point this looks like a bug in pig. Especially with the
> NPE in the stack trace, I'd say this is not expected.
>
> From the line numbers in the version of pig that I'm running, it appears
> to be this bit of code (line 789). It looks like operationID is not present
> in the globalCounters map, and thus when you call iterator() you get the
> NPE.
>
>
>
> 787 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#787
> )
>
>
>                  while(operationIDs.hasNext()) {
>
>
> 788 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#788
> )
>
>
>                      String operationID = operationIDs.next();
>
>
> 789 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#789
> )
>
>
>                      Iterator<Pair<String, Long>> itPairs =
> globalCounters.get(operationID).iterator();
>
>
> 790 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#790
> )
>
>
>                      Pair<String,Long> pair = null;
>
>
> 791 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#791
> )
>
>
>                      while(itPairs.hasNext()) {
>
>
> 792 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#792
> )
>
>
>                          pair = itPairs.next();
>
>
> 793 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#793
> )
>
>
>                          conf.setLong(pair.first, pair.second);
>
>
> 794 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#794
> )
>
>
>                      }
>
>
> 795 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#795
> )
>
>
>                  }
>
>
> 796 (
> http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#796
> )
>
>
>
>
>
>
> -Matt
>
>
> On Monday, October 6, 2014 at 8:54 PM, Sunil S Nandihalli wrote:
>
> > The input file-directory tarred and gzipped is here (
> https://transfer.sh/Nmnkk/rawlogs.tgz) . The Jar file which contains all
> the udfs is here (https://transfer.sh/JpSKg/pigpen.jar)
> >
> > On Tue, Oct 7, 2014 at 9:07 AM, Sunil S Nandihalli <
> sunil.nandiha...@gmail.com (mailto:sunil.nandiha...@gmail.com)> wrote:
> > > Hi Everybody,
> > >  The pig script mba.pig (https://gist.github.com/97073ae7bf16d8be5532)
> is giving me the following error when run. This is a PigPen generated
> script. the log (https://gist.github.com/228a84351440f7b15e62) is here (
> https://gist.github.com/228a84351440f7b15e62). The last few lines of the
> stdout is
> > >
> > > 2014-10-07 03:18:14,252 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader
> - Current split being processed
> file:/tmp/temp-923128527/tmp204410789/part-r-00000:0+0
> > > 2014-10-07 03:18:14,259 [LocalJobRunner Map Task Executor #0] WARN
> org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already
> been initialized
> > > 2014-10-07 03:18:14,281 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map
> - Aliases being processed per job phase (AliasName[line,offset]): M:
> generate6660[329,15],union6387[332,12],generate6661[336,15],generate6662[341,15],generate6663[349,15]
> C:  R:
> > > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.LocalJobRunner -
> > > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.Task - Task:attempt_local710497996_0012_m_000001_0
> is done. And is in the process of committing
> > > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.LocalJobRunner -
> > > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.Task - Task attempt_local710497996_0012_m_000001_0
> is allowed to commit now
> > > 2014-10-07 03:18:14,296 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output
> of task 'attempt_local710497996_0012_m_000001_0' to
> file:/home/hdfs/sunil/mobster-knowledge-clj/hadoop-repl/sunil/output/mba/app-install.clj/_temporary/0/task_local710497996_0012_m_000001
> > > 2014-10-07 03:18:14,298 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output
> of task 'attempt_local710497996_0012_m_000001_0' to
> file:/tmp/temp-923128527/tmp927324561/_temporary/0/task_local710497996_0012_m_000001
> > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.LocalJobRunner - map
> > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.Task - Task
> 'attempt_local710497996_0012_m_000001_0' done.
> > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO
> org.apache.hadoop.mapred.LocalJobRunner - Finishing task:
> attempt_local710497996_0012_m_000001_0
> > > 2014-10-07 03:18:14,299 [Thread-147] INFO
> org.apache.hadoop.mapred.LocalJobRunner - map task executor complete.
> > > 2014-10-07 03:18:14,722 [main] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
> - 63% complete
> > > 2014-10-07 03:18:14,724 [main] WARN
> org.apache.pig.tools.pigstats.PigStatsUtil - Failed to get RunningJob for
> job job_local710497996_0012
> > > 2014-10-07 03:18:14,728 [main] INFO
> org.apache.pig.tools.pigstats.JobStats - using output size reader:
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader
> > > 2014-10-07 03:18:14,731 [main] INFO
> org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added
> to the job
> > > 2014-10-07 03:20:16,529 [main] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
> - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
> > > 2014-10-07 03:20:16,534 [main] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
> - Reduce phase detected, estimating # of required reducers.
> > > 2014-10-07 03:20:16,535 [main] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
> - Setting Parallelism to 1
> > > 2014-10-07 03:20:16,547 [main] INFO
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
> - Setting up multi store job
> > > 2014-10-07 03:20:16,561 [main] ERROR org.apache.pig.tools.grunt.Grunt
> - ERROR 2017: Internal error creating job configuration.
> > >
> > >
> > > Can somebody help me figure out what is happening.
> > > Thanks,
> > > Sunil.
> > >
> > >
> >
> >
> >
> > --
> > You received this message because you are subscribed to the Google
> Groups "PigPen Support" group.
> > To unsubscribe from this group and stop receiving emails from it, send
> an email to pigpen-support+unsubscr...@googlegroups.com (mailto:
> pigpen-support+unsubscr...@googlegroups.com).
> > For more options, visit https://groups.google.com/d/optout.
>
>

Reply via email to