The input file-directory tarred and gzipped is here <https://transfer.sh/Nmnkk/rawlogs.tgz> . The Jar file which contains all the udfs is here <https://transfer.sh/JpSKg/pigpen.jar>
On Tue, Oct 7, 2014 at 9:07 AM, Sunil S Nandihalli < sunil.nandiha...@gmail.com> wrote: > Hi Everybody, > The pig script mba.pig <https://gist.github.com/97073ae7bf16d8be5532> is > giving me the following error when run. This is a PigPen generated script. > the log <https://gist.github.com/228a84351440f7b15e62> is here > <https://gist.github.com/228a84351440f7b15e62>. The last few lines of the > stdout is > > 2014-10-07 03:18:14,252 [LocalJobRunner Map Task Executor #0] INFO > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader > - Current split being processed > file:/tmp/temp-923128527/tmp204410789/part-r-00000:0+0 > 2014-10-07 03:18:14,259 [LocalJobRunner Map Task Executor #0] WARN > org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already > been initialized > 2014-10-07 03:18:14,281 [LocalJobRunner Map Task Executor #0] INFO > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map > - Aliases being processed per job phase (AliasName[line,offset]): M: > generate6660[329,15],union6387[332,12],generate6661[336,15],generate6662[341,15],generate6663[349,15] > C: R: > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.LocalJobRunner - > 2014-10-07 03:18:14,291 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.Task - > Task:attempt_local710497996_0012_m_000001_0 is done. And is in the process > of committing > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.LocalJobRunner - > 2014-10-07 03:18:14,294 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.Task - Task > attempt_local710497996_0012_m_000001_0 is allowed to commit now > 2014-10-07 03:18:14,296 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output > of task 'attempt_local710497996_0012_m_000001_0' to > file:/home/hdfs/sunil/mobster-knowledge-clj/hadoop-repl/sunil/output/mba/app-install.clj/_temporary/0/task_local710497996_0012_m_000001 > 2014-10-07 03:18:14,298 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output > of task 'attempt_local710497996_0012_m_000001_0' to > file:/tmp/temp-923128527/tmp927324561/_temporary/0/task_local710497996_0012_m_000001 > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.LocalJobRunner - map > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.Task - Task > 'attempt_local710497996_0012_m_000001_0' done. > 2014-10-07 03:18:14,299 [LocalJobRunner Map Task Executor #0] INFO > org.apache.hadoop.mapred.LocalJobRunner - Finishing task: > attempt_local710497996_0012_m_000001_0 > 2014-10-07 03:18:14,299 [Thread-147] INFO > org.apache.hadoop.mapred.LocalJobRunner - map task executor complete. > 2014-10-07 03:18:14,722 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - 63% complete > 2014-10-07 03:18:14,724 [main] WARN > org.apache.pig.tools.pigstats.PigStatsUtil - Failed to get RunningJob for > job job_local710497996_0012 > 2014-10-07 03:18:14,728 [main] INFO > org.apache.pig.tools.pigstats.JobStats - using output size reader: > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader > 2014-10-07 03:18:14,731 [main] INFO > org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added > to the job > 2014-10-07 03:20:16,529 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3 > 2014-10-07 03:20:16,534 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > - Reduce phase detected, estimating # of required reducers. > 2014-10-07 03:20:16,535 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > - Setting Parallelism to 1 > 2014-10-07 03:20:16,547 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler > - Setting up multi store job > 2014-10-07 03:20:16,561 [main] ERROR org.apache.pig.tools.grunt.Grunt - > ERROR 2017: Internal error creating job configuration. > > Can somebody help me figure out what is happening. > Thanks, > Sunil. >