Yep - I would agree with that. It says it's fixed in pig 14 - is there an expected release date for that version?
Thanks Cheolsoo -Matt On Tuesday, October 7, 2014 at 7:55 PM, Cheolsoo Park wrote: > This looks like PIG-3985- > https://issues.apache.org/jira/browse/PIG-3985 > > > On Tue, Oct 7, 2014 at 3:35 PM, Matt Bossenbroek > <mbossenbr...@netflix.com.invalid (mailto:mbossenbr...@netflix.com.invalid)> > wrote: > > Played around with this some more. Got some interesting results. > > > > It turns out that having two STORE commands in the script is what is > > causing it to fail. If I comment out either of them, the script will run > > and produce the other result. Because of that, we know that the code used > > in both of those paths is ok. > > > > Also, if I copy the script & run it from the grunt prompt, both outputs > > work fine. I imagine this is because the prompt runs one output at a time. > > > > I'd say at this point this looks like a bug in pig. Especially with the NPE > > in the stack trace, I'd say this is not expected. > > > > From the line numbers in the version of pig that I'm running, it appears to > > be this bit of code (line 789). It looks like operationID is not present in > > the globalCounters map, and thus when you call iterator() you get the NPE. > > > > > > > > 787 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#787) > > > > > > while(operationIDs.hasNext()) { > > > > > > 788 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#788) > > > > > > String operationID = operationIDs.next(); > > > > > > 789 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#789) > > > > > > Iterator<Pair<String, Long>> itPairs = > > globalCounters.get(operationID).iterator(); > > > > > > 790 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#790) > > > > > > Pair<String,Long> pair = null; > > > > > > 791 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#791) > > > > > > while(itPairs.hasNext()) { > > > > > > 792 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#792) > > > > > > pair = itPairs.next(); > > > > > > 793 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#793) > > > > > > conf.setLong(pair.first, pair.second); > > > > > > 794 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#794) > > > > > > } > > > > > > 795 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#795) > > > > > > } > > > > > > 796 > > (http://grepcode.com/file/repo1.maven.org/maven2/org.apache.pig/pig/0.11.1/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/JobControlCompiler.java?av=f#796) > > > > > > > > > > > > > > -Matt > > > > > > On Monday, October 6, 2014 at 8:54 PM, Sunil S Nandihalli wrote: > > > > > The input file-directory tarred and gzipped is here > > > (https://transfer.sh/Nmnkk/rawlogs.tgz) . The Jar file which contains all > > > the udfs is here (https://transfer.sh/JpSKg/pigpen.jar) > > > > > > On Tue, Oct 7, 2014 at 9:07 AM, Sunil S Nandihalli > > > <sunil.nandiha...@gmail.com (mailto:sunil.nandiha...@gmail.com) > > > (mailto:sunil.nandiha...@gmail.com)> wrote: > > > > Hi Everybody, > > > > The pig script mba.pig (https://gist.github.com/97073ae7bf16d8be5532) > > > > is giving me the following error when run. This is a PigPen generated > > > > script. the log (https://gist.github.com/228a84351440f7b15e62) is here > > > > (https://gist.github.com/228a84351440f7b15e62). The last few lines of > > > > the stdout is > > > > > > > > 2014-10-07 03:18:14,252 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader > > > > - Current split being processed > > > > file:/tmp/temp-923128527/tmp204410789/part-r-00000:0+0 > > > > 2014-10-07 03:18:14,259 [LocalJobRunner Map Task Executor #0] WARN > > > > org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already > > > > been initialized > > > > 2014-10-07 03:18:14,281 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map > > > > - Aliases being processed per job phase (AliasName[line,offset]): M: > > > > generate6660[329,15],union6387[332,12],generate6661[336,15],generate6662[341,15],generate6663[349,15] > > > > C: R: > > > > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.LocalJobRunner - > > > > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.Task - > > > > Task:attempt_local710497996_0012_m_000001_0 is done. And is in the > > > > process of committing > > > > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.LocalJobRunner - > > > > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.Task - Task > > > > attempt_local710497996_0012_m_000001_0 is allowed to commit now > > > > 2014-10-07 03:18:14,296 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved > > > > output of task 'attempt_local710497996_0012_m_000001_0' to > > > > file:/home/hdfs/sunil/mobster-knowledge-clj/hadoop-repl/sunil/output/mba/app-install.clj/_temporary/0/task_local710497996_0012_m_000001 > > > > 2014-10-07 03:18:14,298 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved > > > > output of task 'attempt_local710497996_0012_m_000001_0' to > > > > file:/tmp/temp-923128527/tmp927324561/_temporary/0/task_local710497996_0012_m_000001 > > > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.LocalJobRunner - map > > > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.Task - Task > > > > 'attempt_local710497996_0012_m_000001_0' done. > > > > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > > > > org.apache.hadoop.mapred.LocalJobRunner - Finishing task: > > > > attempt_local710497996_0012_m_000001_0 > > > > 2014-10-07 03:18:14,299 [Thread-147] INFO > > > > org.apache.hadoop.mapred.LocalJobRunner - map task executor complete. > > > > 2014-10-07 03:18:14,722 [main] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > > > > - 63% complete > > > > 2014-10-07 03:18:14,724 [main] WARN > > > > org.apache.pig.tools.pigstats.PigStatsUtil - Failed to get RunningJob > > > > for job job_local710497996_0012 > > > > 2014-10-07 03:18:14,728 [main] INFO > > > > org.apache.pig.tools.pigstats.JobStats - using output size reader: > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader > > > > 2014-10-07 03:18:14,731 [main] INFO > > > > org.apache.pig.tools.pigstats.ScriptState - Pig script settings are > > > > added to the job > > > > 2014-10-07 03:20:16,529 [main] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > > > > - mapred.job.reduce.markreset.buffer.percent is not set, set to > > > > default 0.3 > > > > 2014-10-07 03:20:16,534 [main] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > > > > - Reduce phase detected, estimating # of required reducers. > > > > 2014-10-07 03:20:16,535 [main] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > > > > - Setting Parallelism to 1 > > > > 2014-10-07 03:20:16,547 [main] INFO > > > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > > > > - Setting up multi store job > > > > 2014-10-07 03:20:16,561 [main] ERROR org.apache.pig.tools.grunt.Grunt - > > > > ERROR 2017: Internal error creating job configuration. > > > > > > > > > > > > Can somebody help me figure out what is happening. > > > > Thanks, > > > > Sunil. > > > > > > > > > > > > > > > > > > > > -- > > > You received this message because you are subscribed to the Google Groups > > > "PigPen Support" group. > > > To unsubscribe from this group and stop receiving emails from it, send an > > > email to pigpen-support+unsubscr...@googlegroups.com > > > (mailto:pigpen-support%2bunsubscr...@googlegroups.com) > > > (mailto:pigpen-support+unsubscr...@googlegroups.com > > > (mailto:pigpen-support%2bunsubscr...@googlegroups.com)). > > > For more options, visit https://groups.google.com/d/optout. > > > > -- > You received this message because you are subscribed to the Google Groups > "PigPen Support" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pigpen-support+unsubscr...@googlegroups.com > (mailto:pigpen-support+unsubscr...@googlegroups.com). > For more options, visit https://groups.google.com/d/optout.