[ https://issues.apache.org/jira/browse/PIG-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864120#action_12864120 ]
Hadoop QA commented on PIG-1211: -------------------------------- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12443635/PIG-1211.patch against trunk revision 941005. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 6 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. -1 release audit. The applied patch generated 530 release audit warnings (more than the trunk's current 529 warnings). -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/308/testReport/ Release audit warnings: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/308/artifact/trunk/patchprocess/releaseAuditDiffWarnings.txt Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/308/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Console output: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/308/console This message is automatically generated. > Pig script runs half way after which it reports syntax error > ------------------------------------------------------------ > > Key: PIG-1211 > URL: https://issues.apache.org/jira/browse/PIG-1211 > Project: Pig > Issue Type: Improvement > Components: impl > Affects Versions: 0.6.0 > Reporter: Viraj Bhat > Fix For: 0.8.0 > > Attachments: PIG-1211.patch > > > I have a Pig script which is structured in the following way > {code} > register cp.jar > dataset = load '/data/dataset/' using PigStorage('\u0001') as (col1, col2, > col3, col4, col5); > filtered_dataset = filter dataset by (col1 == 1); > proj_filtered_dataset = foreach filtered_dataset generate col2, col3; > rmf $output1; > store proj_filtered_dataset into '$output1' using PigStorage(); > second_stream = foreach filtered_dataset generate col2, col4, col5; > group_second_stream = group second_stream by col4; > output2 = foreach group_second_stream { > a = second_stream.col2 > b = distinct second_stream.col5; > c = order b by $0; > generate 1 as key, group as keyword, MYUDF(c, 100) as finalcalc; > } > rmf $output2; > --syntax error here > store output2 to '$output2' using PigStorage(); > {code} > I run this script using the Multi-query option, it runs successfully till the > first store but later fails with a syntax error. > The usage of HDFS option, "rmf" causes the first store to execute. > The only option the I have is to run an explain before running his script > grunt> explain -script myscript.pig -out explain.out > or moving the rmf statements to the top of the script > Here are some questions: > a) Can we have an option to do something like "checkscript" instead of > explain to get the same syntax error? In this way I can ensure that I do not > run for 3-4 hours before encountering a syntax error > b) Can pig not figure out a way to re-order the rmf statements since all the > store directories are variables > Thanks > Viraj -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.