Randy Wallace created PIG-4322:
----------------------------------
Summary: Enabling SchemaTuple Feature Results in failed jobs
Key: PIG-4322
URL: https://issues.apache.org/jira/browse/PIG-4322
Project: Pig
Issue Type: Bug
Components: grunt, tools
Affects Versions: 0.12.0
Environment: Amazon AWS Elastic Mapreduce AMI 3.2.1
* Amazon 2.4.0
* Pig 0.12.0
* 1x m1.large Master, 40x m1.large Core, 20x m1.large Task
Reporter: Randy Wallace
Priority: Minor
This is the stack trace that causes my jobs to fail:
{code}
Error: java.io.FileNotFoundException: SchemaTuple_21$1.class (No such file or
directory) at java.io.FileInputStream.open(Native Method) at
java.io.FileInputStream.<init>(FileInputStream.java:146) at
org.apache.pig.data.SchemaTupleBackend.copyAllFromDistributedCache(SchemaTupleBackend.java:187)
at
org.apache.pig.data.SchemaTupleBackend.copyAndResolve(SchemaTupleBackend.java:160)
at
org.apache.pig.data.SchemaTupleBackend.initialize(SchemaTupleBackend.java:278)
at
org.apache.pig.data.SchemaTupleBackend.initialize(SchemaTupleBackend.java:268)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.setup(PigGenericMapBase.java:174)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at
org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:775) at
org.apache.hadoop.mapred.MapTask.run(MapTask.java:341) at
org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167) at
java.security.AccessController.doPrivileged(Native Method) at
javax.security.auth.Subject.doAs(Subject.java:415) at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
{code}
Here is a grep of the pig logs that refer to SchemaTuple_21:
{code}
2014-11-11 11:50:00,275 [main] INFO
org.apache.pig.data.SchemaTupleClassGenerator - Compiling class SchemaTuple_21
for Schema: {{(long,long,int,datetime,long,chararray)}}, and appendability:
false
2014-11-11 11:50:00,514 [main] INFO
org.apache.pig.data.SchemaTupleClassGenerator - Successfully compiled class:
SchemaTuple_21
2014-11-11 11:50:02,470 [main] INFO org.apache.pig.data.SchemaTupleFrontend -
File successfully added to the distributed cache: SchemaTuple_21.class
2014-11-11 11:50:02,551 [main] INFO org.apache.pig.data.SchemaTupleFrontend -
File successfully added to the distributed cache: SchemaTuple_21$1.class
2014-11-11 11:50:07,378 [main] INFO org.apache.pig.data.SchemaTupleFrontend -
Setting key [pig.schematuple.classes] with classes to deserialize
[SchemaTuple_16.class,SchemaTuple_12.class,SchemaTuple_2.class,SchemaTuple_51.class,SchemaTuple_20.class,SchemaTuple_21.class,SchemaTuple_26.class,SchemaTuple_39.class,SchemaTuple_21$1.class,SchemaTuple_19.class,SchemaTuple_53.class,SchemaTuple_10.class,SchemaTuple_27.class,SchemaTuple_25$1.class,SchemaTuple_5.class,SchemaTuple_35$1.class,SchemaTuple_45.class,SchemaTuple_32$1.class,SchemaTuple_50.class,SchemaTuple_33.class,SchemaTuple_64.class,SchemaTuple_54$1.class,SchemaTuple_57.class,SchemaTuple_57$1.class,SchemaTuple_47$1.class,SchemaTuple_35.class,SchemaTuple_7$1.class,SchemaTuple_56.class,SchemaTuple_29.class,SchemaTuple_52$1.class,SchemaTuple_40$1.class,SchemaTuple_55$1.class,SchemaTuple_48$1.class,SchemaTuple_61$1.class,SchemaTuple_0.class,SchemaTuple_46$1.class,SchemaTuple_2$1.class,SchemaTuple_3.class,SchemaTuple_15$1.class,SchemaTuple_28$1.class,SchemaTuple_49$1.class,SchemaTuple_16$1.class,SchemaTuple_60.class,SchemaTuple_7.class,SchemaTuple_9.class,SchemaTuple_44$1.class,SchemaTuple_11$1.class,SchemaTuple_52.class,SchemaTuple_1$1.class,SchemaTuple_13$1.class,SchemaTuple_19$1.class,SchemaTuple_9$1.class,SchemaTuple_56$1.class,SchemaTuple_17$1.class,SchemaTuple_72$1.class,SchemaTuple_25.class,SchemaTuple_55.class,SchemaTuple_30.class,SchemaTuple_69.class,SchemaTuple_62$1.class,SchemaTuple_71$1.class,SchemaTuple_41.class,SchemaTuple_68$1.class,SchemaTuple_72.class,SchemaTuple_49.class,SchemaTuple_26$1.class,SchemaTuple_69$1.class,SchemaTuple_3$1.class,SchemaTuple_65$1.class,SchemaTuple_61.class,SchemaTuple_30$1.class,SchemaTuple_59$1.class,SchemaTuple_66$1.class,SchemaTuple_20$1.class,SchemaTuple_53$1.class,SchemaTuple_24.class,SchemaTuple_70.class,SchemaTuple_66.class,SchemaTuple_60$1.class,SchemaTuple_42.class,SchemaTuple_59.class,SchemaTuple_40.class,SchemaTuple_47.class,SchemaTuple_63.class,SchemaTuple_67.class,SchemaTuple_36$1.class,SchemaTuple_50$1.class,SchemaTuple_71.class,SchemaTuple_38$1.class,SchemaTuple_58$1.class,SchemaTuple_51$1.class,SchemaTuple_41$1.class,SchemaTuple_64$1.class,SchemaTuple_58.class,SchemaTuple_43.class,SchemaTuple_44.class,SchemaTuple_28.class,SchemaTuple_13.class,SchemaTuple_63$1.class,SchemaTuple_29$1.class,SchemaTuple_37.class,SchemaTuple_37$1.class,SchemaTuple_6.class,SchemaTuple_31.class,SchemaTuple_4$1.class,SchemaTuple_68.class,SchemaTuple_14.class,SchemaTuple_32.class,SchemaTuple_14$1.class,SchemaTuple_62.class,SchemaTuple_18$1.class,SchemaTuple_65.class,SchemaTuple_38.class,SchemaTuple_42$1.class,SchemaTuple_33$1.class,SchemaTuple_4.class,SchemaTuple_34$1.class,SchemaTuple_23$1.class,SchemaTuple_34.class,SchemaTuple_6$1.class,SchemaTuple_1.class,SchemaTuple_39$1.class,SchemaTuple_23.class,SchemaTuple_12$1.class,SchemaTuple_17.class,SchemaTuple_8$1.class,SchemaTuple_10$1.class,SchemaTuple_31$1.class,SchemaTuple_67$1.class,SchemaTuple_11.class,SchemaTuple_22.class,SchemaTuple_45$1.class,SchemaTuple_15.class,SchemaTuple_0$1.class,SchemaTuple_24$1.class,SchemaTuple_36.class,SchemaTuple_43$1.class,SchemaTuple_18.class,SchemaTuple_70$1.class,SchemaTuple_46.class,SchemaTuple_54.class,SchemaTuple_22$1.class,SchemaTuple_5$1.class,SchemaTuple_27$1.class,SchemaTuple_48.class,SchemaTuple_8.class]
{code}
I can't find anywhere in my script (~1k lines) any way in which this schema
would present itself: {{(long,long,int,datetime,long,chararray)}} as referred
above, to include within a nested foreach, after a join, or as part of a group
by, but i do use all of those data types. Besides the stack thrown by the maps
which causes everything to fail, the pig logging looks good, so I don't know
what else I can provide that would help.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)