[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: (was: org.apache.pig.pigpen_0.7.4.jar)

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: org.apache.pig.pigpen_0.7.4.jar

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: org.apache.pig.pigpen-0.7.4.tar.gz

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: (was: org.apache.pig.pigpen-0.7.4.tar.gz)

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910118#action_12910118
 ] 

Robert Gibbon commented on PIG-366:
---

Added support for Windows and environments where only a JRE has been installed 
(no need for JDK)

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: org.apache.pig.pigpen-0.7.4.tar.gz

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: (was: org.apache.pig.pigpen-0.7.4.tar.gz)

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: (was: org.apache.pig.pigpen_0.7.4.jar)

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2010-09-16 Thread Robert Gibbon (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Gibbon updated PIG-366:
--

Attachment: org.apache.pig.pigpen_0.7.4.jar

 PigPen - Eclipse plugin for a graphical PigLatin editor
 ---

 Key: PIG-366
 URL: https://issues.apache.org/jira/browse/PIG-366
 Project: Pig
  Issue Type: New Feature
Reporter: Shubham Chopra
Assignee: Robert Gibbon
Priority: Minor
 Attachments: org.apache.pig.pigpen-0.7.0.tar.gz, 
 org.apache.pig.pigpen-0.7.2.tar.gz, org.apache.pig.pigpen-0.7.4.tar.gz, 
 org.apache.pig.pigpen_0.0.1.jar, org.apache.pig.pigpen_0.0.1.tgz, 
 org.apache.pig.pigpen_0.0.4.jar, org.apache.pig.pigpen_0.7.2.jar, 
 org.apache.pig.pigpen_0.7.4.jar, pigpen.patch, pigPen.patch, PigPen.tgz


 This is an Eclipse plugin that provides a GUI that can help users create 
 PigLatin scripts and see the example generator outputs on the fly and submit 
 the jobs to hadoop clusters.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1610) 'union onschema' does handle some cases involving 'namespaced' column names in schema

2010-09-16 Thread Thejas M Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated PIG-1610:
---

Attachment: PIG-1610.2.patch

PIG-1610.2.patch fixes the issues mentioned in previous comment.
 passes unit tests and test-patch.


 'union onschema' does handle some cases involving 'namespaced' column names 
 in schema
 -

 Key: PIG-1610
 URL: https://issues.apache.org/jira/browse/PIG-1610
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0

 Attachments: PIG-1610.1.patch, PIG-1610.2.patch


 case 1:
 grunt describe f;  
 f: {l1::a: bytearray,l1::b: bytearray}
 grunt describe l1;
 l1: {a: bytearray,b: bytearray}
 grunt dump f;
 (1,11)
 (2,22)
 (3,33)
 grunt dump l1;
 (1,11)
 (2,22)
 (3,33)
 grunt u = union onschema f, l1;
 grunt describe u;
 u: {l1::a: bytearray,l1::b: bytearray}
 -- the dump u gives incorrect results
 grunt dump u; 
 (,)
 (,)
 (,)
 (1,11)
 (2,22)
 (3,33)
 case 2:
 grunt u = union onschema l1, f;
 grunt describe u;
 2010-09-13 15:11:13,877 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
 1108: Duplicate schema alias: l1::a
 Details at logfile: /Users/tejas/pig_unions_err2/trunk/pig_1284410413970.log

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1229) allow pig to write output into a JDBC db

2010-09-16 Thread Sandesh Devaraju (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910331#action_12910331
 ] 

Sandesh Devaraju commented on PIG-1229:
---

I upgraded to 0.7 and tried the updated patch. However, I don't see any entries 
in the database.
Upon further investigation, I noticed that in my particular case, the batch 
size was 100 and the number of output records that ended up at every reducer 
was below this threshold.
I added a debug statement to the OuputComitter's commitTask method and found 
that count was 0.
Any ideas why this might be happening?

 allow pig to write output into a JDBC db
 

 Key: PIG-1229
 URL: https://issues.apache.org/jira/browse/PIG-1229
 Project: Pig
  Issue Type: New Feature
  Components: impl
Reporter: Ian Holsman
Assignee: Ankur
Priority: Minor
 Fix For: 0.8.0

 Attachments: jira-1229-final.patch, jira-1229-final.test-fix.patch, 
 jira-1229-v2.patch, jira-1229-v3.patch, pig-1229.2.patch, pig-1229.patch


 UDF to store data into a DB

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (PIG-1615) Return code from Pig is 0 even if the job fails when using -M flag

2010-09-16 Thread Viraj Bhat (JIRA)
Return code from Pig is 0 even if the job fails when using -M flag
--

 Key: PIG-1615
 URL: https://issues.apache.org/jira/browse/PIG-1615
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.7.0, 0.6.0
Reporter: Viraj Bhat
 Fix For: 0.8.0


I have a Pig script of this form, which I used inside a workflow system such as 
Oozie.
{code}
A = load  '$INPUT' using PigStorage();
store A into '$OUTPUT';
{code}

I run this as with Multi-query optimization turned off :
{quote}
$java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
INPUT=/user/viraj/junk1 -M -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
{quote}

The directory /user/viraj/junk1 is not present

I get the following results:
{quote}
Input(s):
Failed to read data from /user/viraj/junk1
Output(s):
Failed to produce result in /user/viraj/junk2
{quote}

This is expected, but the return code is still 0
{code}
$ echo $?
0
{code}

If I run this script with Multi-query optimization turned on, it gives, a 
return code of 2, which is correct.

{code}
$ java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
INPUT=/user/viraj/junk1 -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
...
$ echo $?
2
{code}

I believe a wrong return code from Pig, is causing Oozie to believe that Pig 
script succeeded.

Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1615) Return code from Pig is 0 even if the job fails when using -M flag

2010-09-16 Thread Richard Ding (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910407#action_12910407
 ] 

Richard Ding commented on PIG-1615:
---

This problem exists in Pig 0.7 and fixed in Pig 0.8.

 Return code from Pig is 0 even if the job fails when using -M flag
 --

 Key: PIG-1615
 URL: https://issues.apache.org/jira/browse/PIG-1615
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.6.0, 0.7.0
Reporter: Viraj Bhat
 Fix For: 0.8.0


 I have a Pig script of this form, which I used inside a workflow system such 
 as Oozie.
 {code}
 A = load  '$INPUT' using PigStorage();
 store A into '$OUTPUT';
 {code}
 I run this as with Multi-query optimization turned off :
 {quote}
 $java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -M -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 {quote}
 The directory /user/viraj/junk1 is not present
 I get the following results:
 {quote}
 Input(s):
 Failed to read data from /user/viraj/junk1
 Output(s):
 Failed to produce result in /user/viraj/junk2
 {quote}
 This is expected, but the return code is still 0
 {code}
 $ echo $?
 0
 {code}
 If I run this script with Multi-query optimization turned on, it gives, a 
 return code of 2, which is correct.
 {code}
 $ java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 ...
 $ echo $?
 2
 {code}
 I believe a wrong return code from Pig, is causing Oozie to believe that Pig 
 script succeeded.
 Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (PIG-1616) 'union onschema' does not use create output with correct schema when udfs are involved

2010-09-16 Thread Thejas M Nair (JIRA)
'union onschema' does not use create output with correct schema when udfs are 
involved
--

 Key: PIG-1616
 URL: https://issues.apache.org/jira/browse/PIG-1616
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0


'union onshcema' creates a merged schema based on the input schemas. It does 
that in the queryparser, and at that stage the udf return type used is the 
default return type.  The actual return type for the udf is determined later in 
the TypeCheckingVisitor using EvalFunc.getArgsToFuncMapping().
'union onschema' should use the final type for its input relation to create the 
merged schema.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1610) 'union onschema' does handle some cases involving 'namespaced' column names in schema

2010-09-16 Thread Thejas M Nair (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910408#action_12910408
 ] 

Thejas M Nair commented on PIG-1610:


There is a problem with 'union onschema' implementation that is not specific to 
this jira, I have created a new jira to address that - PIG-1616.


 'union onschema' does handle some cases involving 'namespaced' column names 
 in schema
 -

 Key: PIG-1610
 URL: https://issues.apache.org/jira/browse/PIG-1610
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0

 Attachments: PIG-1610.1.patch, PIG-1610.2.patch


 case 1:
 grunt describe f;  
 f: {l1::a: bytearray,l1::b: bytearray}
 grunt describe l1;
 l1: {a: bytearray,b: bytearray}
 grunt dump f;
 (1,11)
 (2,22)
 (3,33)
 grunt dump l1;
 (1,11)
 (2,22)
 (3,33)
 grunt u = union onschema f, l1;
 grunt describe u;
 u: {l1::a: bytearray,l1::b: bytearray}
 -- the dump u gives incorrect results
 grunt dump u; 
 (,)
 (,)
 (,)
 (1,11)
 (2,22)
 (3,33)
 case 2:
 grunt u = union onschema l1, f;
 grunt describe u;
 2010-09-13 15:11:13,877 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
 1108: Duplicate schema alias: l1::a
 Details at logfile: /Users/tejas/pig_unions_err2/trunk/pig_1284410413970.log

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (PIG-1615) Return code from Pig is 0 even if the job fails when using -M flag

2010-09-16 Thread Olga Natkovich (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Olga Natkovich resolved PIG-1615.
-

Resolution: Fixed

 Return code from Pig is 0 even if the job fails when using -M flag
 --

 Key: PIG-1615
 URL: https://issues.apache.org/jira/browse/PIG-1615
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.6.0, 0.7.0
Reporter: Viraj Bhat
 Fix For: 0.8.0


 I have a Pig script of this form, which I used inside a workflow system such 
 as Oozie.
 {code}
 A = load  '$INPUT' using PigStorage();
 store A into '$OUTPUT';
 {code}
 I run this as with Multi-query optimization turned off :
 {quote}
 $java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -M -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 {quote}
 The directory /user/viraj/junk1 is not present
 I get the following results:
 {quote}
 Input(s):
 Failed to read data from /user/viraj/junk1
 Output(s):
 Failed to produce result in /user/viraj/junk2
 {quote}
 This is expected, but the return code is still 0
 {code}
 $ echo $?
 0
 {code}
 If I run this script with Multi-query optimization turned on, it gives, a 
 return code of 2, which is correct.
 {code}
 $ java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 ...
 $ echo $?
 2
 {code}
 I believe a wrong return code from Pig, is causing Oozie to believe that Pig 
 script succeeded.
 Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1610) 'union onschema' does handle some cases involving 'namespaced' column names in schema

2010-09-16 Thread Richard Ding (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910409#action_12910409
 ] 

Richard Ding commented on PIG-1610:
---

+1

 'union onschema' does handle some cases involving 'namespaced' column names 
 in schema
 -

 Key: PIG-1610
 URL: https://issues.apache.org/jira/browse/PIG-1610
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0

 Attachments: PIG-1610.1.patch, PIG-1610.2.patch


 case 1:
 grunt describe f;  
 f: {l1::a: bytearray,l1::b: bytearray}
 grunt describe l1;
 l1: {a: bytearray,b: bytearray}
 grunt dump f;
 (1,11)
 (2,22)
 (3,33)
 grunt dump l1;
 (1,11)
 (2,22)
 (3,33)
 grunt u = union onschema f, l1;
 grunt describe u;
 u: {l1::a: bytearray,l1::b: bytearray}
 -- the dump u gives incorrect results
 grunt dump u; 
 (,)
 (,)
 (,)
 (1,11)
 (2,22)
 (3,33)
 case 2:
 grunt u = union onschema l1, f;
 grunt describe u;
 2010-09-13 15:11:13,877 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
 1108: Duplicate schema alias: l1::a
 Details at logfile: /Users/tejas/pig_unions_err2/trunk/pig_1284410413970.log

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (PIG-1617) 'group all' should always use one reducer

2010-09-16 Thread Thejas M Nair (JIRA)
'group all' should always use one reducer
-

 Key: PIG-1617
 URL: https://issues.apache.org/jira/browse/PIG-1617
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0


'group all' sends all rows to a single reducer, it does not make sense to spawn 
more than one reducer for it. But if higher value of parallelism is specified 
or if the input is large enough so that changes in PIG-1249 result in larger 
value being set, there are additional reducers spawned that don't do anything 
useful.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1610) 'union onschema' does handle some cases involving 'namespaced' column names in schema

2010-09-16 Thread Thejas M Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated PIG-1610:
---

  Status: Resolved  (was: Patch Available)
Hadoop Flags: [Reviewed]
  Resolution: Fixed

Patch committed to trunk and 0.8 branch.


 'union onschema' does handle some cases involving 'namespaced' column names 
 in schema
 -

 Key: PIG-1610
 URL: https://issues.apache.org/jira/browse/PIG-1610
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.8.0

 Attachments: PIG-1610.1.patch, PIG-1610.2.patch


 case 1:
 grunt describe f;  
 f: {l1::a: bytearray,l1::b: bytearray}
 grunt describe l1;
 l1: {a: bytearray,b: bytearray}
 grunt dump f;
 (1,11)
 (2,22)
 (3,33)
 grunt dump l1;
 (1,11)
 (2,22)
 (3,33)
 grunt u = union onschema f, l1;
 grunt describe u;
 u: {l1::a: bytearray,l1::b: bytearray}
 -- the dump u gives incorrect results
 grunt dump u; 
 (,)
 (,)
 (,)
 (1,11)
 (2,22)
 (3,33)
 case 2:
 grunt u = union onschema l1, f;
 grunt describe u;
 2010-09-13 15:11:13,877 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
 1108: Duplicate schema alias: l1::a
 Details at logfile: /Users/tejas/pig_unions_err2/trunk/pig_1284410413970.log

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1615) Return code from Pig is 0 even if the job fails when using -M flag

2010-09-16 Thread Viraj Bhat (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910414#action_12910414
 ] 

Viraj Bhat commented on PIG-1615:
-

I tested this on Pig 0.8, but with a downloaded version, which was little old. 

I re-downloaded the latest source, seems to be fixed.

Viraj

 Return code from Pig is 0 even if the job fails when using -M flag
 --

 Key: PIG-1615
 URL: https://issues.apache.org/jira/browse/PIG-1615
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.6.0, 0.7.0
Reporter: Viraj Bhat
 Fix For: 0.8.0


 I have a Pig script of this form, which I used inside a workflow system such 
 as Oozie.
 {code}
 A = load  '$INPUT' using PigStorage();
 store A into '$OUTPUT';
 {code}
 I run this as with Multi-query optimization turned off :
 {quote}
 $java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -M -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 {quote}
 The directory /user/viraj/junk1 is not present
 I get the following results:
 {quote}
 Input(s):
 Failed to read data from /user/viraj/junk1
 Output(s):
 Failed to produce result in /user/viraj/junk2
 {quote}
 This is expected, but the return code is still 0
 {code}
 $ echo $?
 0
 {code}
 If I run this script with Multi-query optimization turned on, it gives, a 
 return code of 2, which is correct.
 {code}
 $ java -cp ~/pig-svn/trunk/pig.jar:$HADOOP_CONF_DIR org.apache.pig.Main -p 
 INPUT=/user/viraj/junk1 -p OUTPUT=/user/viraj/junk2 loadpigstorage.pig
 ...
 $ echo $?
 2
 {code}
 I believe a wrong return code from Pig, is causing Oozie to believe that Pig 
 script succeeded.
 Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1565) additional piggybank datetime and string UDFs

2010-09-16 Thread Andrew Hitchcock (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Hitchcock updated PIG-1565:
--

Attachment: PIG-1565-2.patch

Made changes to LAST_INDEX_OF, INDEXOF, and SPLIT_ON_REGEX as per request. Also 
fixed the test case bug, which was caused by a missing change (this patch now 
extends SUBSTRING with more functionality).

 additional piggybank datetime and string UDFs
 -

 Key: PIG-1565
 URL: https://issues.apache.org/jira/browse/PIG-1565
 Project: Pig
  Issue Type: Improvement
Reporter: Andrew Hitchcock
Assignee: Andrew Hitchcock
 Fix For: 0.8.0

 Attachments: PIG-1565-1.patch, PIG-1565-2.patch


 Pig is missing a variety of UDFs that might be helpful for users implementing 
 Pig scripts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1565) additional piggybank datetime and string UDFs

2010-09-16 Thread Andrew Hitchcock (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Hitchcock updated PIG-1565:
--

Status: Patch Available  (was: Open)

 additional piggybank datetime and string UDFs
 -

 Key: PIG-1565
 URL: https://issues.apache.org/jira/browse/PIG-1565
 Project: Pig
  Issue Type: Improvement
Reporter: Andrew Hitchcock
Assignee: Andrew Hitchcock
 Fix For: 0.8.0

 Attachments: PIG-1565-1.patch, PIG-1565-2.patch


 Pig is missing a variety of UDFs that might be helpful for users implementing 
 Pig scripts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (PIG-1229) allow pig to write output into a JDBC db

2010-09-16 Thread Ankur (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910441#action_12910441
 ] 

Ankur commented on PIG-1229:


In the putNext() method, count is reset to 0 every time the number of tuples 
added to the batch exceed 'batchSize'. The batch is then executed and its 
parameters cleared. There is currently 
an ExecException in the putNext() method that is being ignored. Can you try 
adding some debugging System.outs and check the stdout/stderr of your reducers 
to see if that is the problem ?

 allow pig to write output into a JDBC db
 

 Key: PIG-1229
 URL: https://issues.apache.org/jira/browse/PIG-1229
 Project: Pig
  Issue Type: New Feature
  Components: impl
Reporter: Ian Holsman
Assignee: Ankur
Priority: Minor
 Fix For: 0.8.0

 Attachments: jira-1229-final.patch, jira-1229-final.test-fix.patch, 
 jira-1229-v2.patch, jira-1229-v3.patch, pig-1229.2.patch, pig-1229.patch


 UDF to store data into a DB

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.