Re: Hadoop 2.3 and pig

2013-09-04 Thread Prashant Kommireddi
Thanks Rohini, good to know. On Tuesday, September 3, 2013, Rohini Palaniswamy wrote: I know many of you are trying out Hadoop 2.x. Just FYI for those to save time if they hit the following issue when they are building directly off the branch. pig joins (replication, skewed and merge

[jira] [Updated] (PIG-3450) error when pigrunner.run embedded python stored in hdfs

2013-09-04 Thread dmitry d (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dmitry d updated PIG-3450: -- Description: I try to run embedded python from java program using PigRunner. Script file store in hdfs. And I

[jira] [Updated] (PIG-2417) Streaming UDFs - allow users to easily write UDFs in scripting languages with no JVM implementation.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-2417: - Fix Version/s: 0.12 Streaming UDFs - allow users to easily write UDFs in scripting languages with

[jira] [Updated] (PIG-3426) Add support for removing s3 files

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3426: - Fix Version/s: 0.12 Add support for removing s3 files -

Re: Are we ready for Pig 0.12.0 release?

2013-09-04 Thread Jeremy Karn
I have one JIRA https://issues.apache.org/jira/browse/PIG-2417 that I would like to get into 0.12 because we've had a number of people ask us about getting it committed back to Apache. However, if it looks like too much to review and get committed in the next week or two it could probably be

[jira] [Updated] (PIG-3388) No support for Regex for row filter in org.apache.pig.backend.hadoop.hbase.HBaseStorage

2013-09-04 Thread Lorand Bendig (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lorand Bendig updated PIG-3388: --- Attachment: PIG-3388.patch When using -regex scan will use RegexStringComparator on RowFilter to match

[jira] [Updated] (PIG-3388) No support for Regex for row filter in org.apache.pig.backend.hadoop.hbase.HBaseStorage

2013-09-04 Thread Lorand Bendig (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lorand Bendig updated PIG-3388: --- Assignee: Lorand Bendig Status: Patch Available (was: Open) No support for Regex for row

Re: Propose UDF

2013-09-04 Thread Alan Gates
A few questions: 1) Why did you try to use RANK? I don't see how rank is part of this. 2) The semantics here aren't clear to me. record_id appears to be crossed with name and id but name and id appear to be chosen in order. If this is join semantics I'd have expected two more entries in B,

[jira] [Commented] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758041#comment-13758041 ] Jeremy Karn commented on PIG-3430: -- New patch that applies cleanly after PIG-3419.

[jira] [Updated] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3430: - Attachment: PIG-3430-2.patch Add xml format for explaining MapReduce Plan.

[jira] [Commented] (PIG-2417) Streaming UDFs - allow users to easily write UDFs in scripting languages with no JVM implementation.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758198#comment-13758198 ] Jeremy Karn commented on PIG-2417: -- Latest patch just has a couple small changes so that it

[jira] [Updated] (PIG-2417) Streaming UDFs - allow users to easily write UDFs in scripting languages with no JVM implementation.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-2417: - Attachment: PIG-2417-6.patch Streaming UDFs - allow users to easily write UDFs in scripting

[jira] [Commented] (PIG-3431) Return more information for parsing related exceptions.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758203#comment-13758203 ] Jeremy Karn commented on PIG-3431: -- Latest patch just has a couple of changes so that it

[jira] [Updated] (PIG-3431) Return more information for parsing related exceptions.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3431: - Attachment: PIG-3431-2.patch Return more information for parsing related exceptions.

[jira] [Commented] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758283#comment-13758283 ] Jeremy Karn commented on PIG-3430: -- Fix two small bugs: * Not closing the plan tag when

[jira] [Updated] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3430: - Attachment: PIG-3430-3.patch Add xml format for explaining MapReduce Plan.

[jira] [Commented] (PIG-3295) Casting from bytearray failing after Union (even when each field is from a single Loader)

2013-09-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758291#comment-13758291 ] Daniel Dai commented on PIG-3295: - How about doing more aggressively by checking LoadCaster?

[jira] [Commented] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758340#comment-13758340 ] Daniel Dai commented on PIG-3430: - Thanks Jeremy. I am not worrying about not supporting

[jira] [Commented] (PIG-3426) Add support for removing s3 files

2013-09-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758425#comment-13758425 ] Daniel Dai commented on PIG-3426: - +1. Will commit shortly. Add support

[jira] [Commented] (PIG-3431) Return more information for parsing related exceptions.

2013-09-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758505#comment-13758505 ] Daniel Dai commented on PIG-3431: - I didn't find any test case verify the exception and

Re: Review Request 13781: Changes to add support for streaming_python udfs.

2013-09-04 Thread Julien Le Dem
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13781/#review25913 --- src/org/apache/pig/builtin/StreamUDFToPig.java

[jira] Subscription: PIG patch available

2013-09-04 Thread jira
Issue Subscription Filter: PIG patch available (18 issues) Subscriber: pigdaily Key Summary PIG-3449Move JobCreationException to org.apache.pig.backend.hadoop.executionengine https://issues.apache.org/jira/browse/PIG-3449 PIG-3448Tez backend layout

[jira] [Commented] (PIG-3431) Return more information for parsing related exceptions.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758701#comment-13758701 ] Jeremy Karn commented on PIG-3431: -- Here's a new patch with two test cases to check for the

[jira] [Updated] (PIG-3431) Return more information for parsing related exceptions.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3431: - Attachment: PIG-3431-3.patch Return more information for parsing related exceptions.

[jira] [Updated] (PIG-3430) Add xml format for explaining MapReduce Plan.

2013-09-04 Thread Jeremy Karn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Karn updated PIG-3430: - Release Note: Pig now supports printing out the MapReduce plan in an xml format. To run: pig -e 'explain