[jira] Subscription: PIG patch available

2014-10-15 Thread jira
Issue Subscription Filter: PIG patch available (18 issues) Subscriber: pigdaily Key Summary PIG-4184UDF backward compatibility issue after POStatus.STATUS_NULL refactory https://issues.apache.org/jira/browse/PIG-4184 PIG-4160-forcelocaljars / -j flag when using a

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172107#comment-14172107 ] Daniel Dai commented on PIG-4227: - [~cheolsoo], looked at scriptingudf.complexTypes, python

[jira] [Created] (PIG-4236) Avoid packaging spark specific jars into pig fat jar

2014-10-15 Thread Praveen Rachabattuni (JIRA)
Praveen Rachabattuni created PIG-4236: - Summary: Avoid packaging spark specific jars into pig fat jar Key: PIG-4236 URL: https://issues.apache.org/jira/browse/PIG-4236 Project: Pig Issue

[jira] [Commented] (PIG-4233) Package pig along with dependencies into a fat jar while job submission to Spark cluster

2014-10-15 Thread Praveen Rachabattuni (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172212#comment-14172212 ] Praveen Rachabattuni commented on PIG-4233: --- [~rohini] So, the idea would be to

[jira] [Assigned] (PIG-4237) Error when there is a bag inside an RDD

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz reassigned PIG-4237: -- Assignee: Carlos Balduz Error when there is a bag inside an RDD

[jira] [Created] (PIG-4237) Error when there is a bag inside an RDD

2014-10-15 Thread Carlos Balduz (JIRA)
Carlos Balduz created PIG-4237: -- Summary: Error when there is a bag inside an RDD Key: PIG-4237 URL: https://issues.apache.org/jira/browse/PIG-4237 Project: Pig Issue Type: Bug

[jira] [Commented] (PIG-4237) Error when there is a bag inside an RDD

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172256#comment-14172256 ] Carlos Balduz commented on PIG-4237: How should I proceed with this issue

[jira] [Commented] (PIG-4234) Order By error after Group By in Spark

2014-10-15 Thread Praveen Rachabattuni (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172263#comment-14172263 ] Praveen Rachabattuni commented on PIG-4234: --- I have below script to be working on

[jira] [Commented] (PIG-4234) Order By error after Group By in Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172270#comment-14172270 ] Carlos Balduz commented on PIG-4234: This is my script: A = LOAD 'movies_data.csv'

[jira] [Commented] (PIG-4233) Package pig along with dependencies into a fat jar while job submission to Spark cluster

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172432#comment-14172432 ] Rohini Palaniswamy commented on PIG-4233: - You should create a spark directory under

[jira] [Comment Edited] (PIG-4233) Package pig along with dependencies into a fat jar while job submission to Spark cluster

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172432#comment-14172432 ] Rohini Palaniswamy edited comment on PIG-4233 at 10/15/14 2:51 PM:

[jira] [Commented] (PIG-4160) -forcelocaljars / -j flag when using a remote url for a script

2014-10-15 Thread Andrew C. Oliver (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172451#comment-14172451 ] Andrew C. Oliver commented on PIG-4160: --- will do. -forcelocaljars / -j flag when

[jira] [Commented] (PIG-4160) -forcelocaljars / -j flag when using a remote url for a script

2014-10-15 Thread Andrew C. Oliver (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172452#comment-14172452 ] Andrew C. Oliver commented on PIG-4160: --- oh you beat me to it, even better :)

[jira] [Commented] (PIG-4124) Command for Python streaming udf should be configurable

2014-10-15 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172502#comment-14172502 ] Mike Sukmanowsky commented on PIG-4124: --- Awesome, thanks [~cheolsoo]! So we'll be able

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172620#comment-14172620 ] Cheolsoo Park commented on PIG-4227: [~daijy], sorry for breaking unit tests. {quote} I

[jira] [Updated] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz updated PIG-4231: --- Status: Open (was: Patch Available) Make rank work with Spark -

[jira] [Updated] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz updated PIG-4231: --- Status: Patch Available (was: Open) Make rank work with Spark -

[jira] [Work started] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on PIG-4231 started by Carlos Balduz. -- Make rank work with Spark - Key: PIG-4231

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172634#comment-14172634 ] Daniel Dai commented on PIG-4227: - Shall we make python udf insert tuple automatically?

[jira] [Updated] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz updated PIG-4231: --- Status: Patch Available (was: In Progress) diff --git

[jira] [Commented] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172646#comment-14172646 ] Carlos Balduz commented on PIG-4231: [~praveenr019] I am trying to push to git the 2 new

[jira] [Work started] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on PIG-4231 started by Carlos Balduz. -- Make rank work with Spark - Key: PIG-4231

[jira] [Updated] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz updated PIG-4231: --- Attachment: PIG-4231.patch Make rank work with Spark - Key:

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172658#comment-14172658 ] Cheolsoo Park commented on PIG-4227: {quote} Otherwise we break python udf which do

[jira] [Updated] (PIG-4231) Make rank work with Spark

2014-10-15 Thread Carlos Balduz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Balduz updated PIG-4231: --- Status: Open (was: Patch Available) Make rank work with Spark -

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172688#comment-14172688 ] Daniel Dai commented on PIG-4227: - This is a corner case if a tuple contains a single item,

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172704#comment-14172704 ] Cheolsoo Park commented on PIG-4227: Yes, you're right. Streaming Python UDF handles

[jira] [Updated] (PIG-4166) Collected group drops last record when combined with merge join

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4166: Fix Version/s: (was: 0.14.0) 0.15.0 Collected group drops last record when combined

[jira] [Updated] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4227: Attachment: PIG-4227-2.patch Attach a patch to add tuple automatically. [~cheolsoo], can you check if that

[jira] [Updated] (PIG-4224) Upload Tez payload history string to timeline server

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4224: Attachment: PIG-4224-1.patch Upload configuration as xml since it is easier. Upload Tez payload history

[jira] [Updated] (PIG-4224) Upload Tez payload history string to timeline server

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4224: Status: Patch Available (was: Open) Upload Tez payload history string to timeline server

[jira] [Commented] (PIG-3979) group all performance, garbage collection, and incremental aggregation

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173038#comment-14173038 ] Rohini Palaniswamy commented on PIG-3979: - This patch makes almost all spill related

[jira] [Commented] (PIG-3979) group all performance, garbage collection, and incremental aggregation

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173053#comment-14173053 ] Rohini Palaniswamy commented on PIG-3979: - Also I see issues and performance impact

[jira] [Commented] (PIG-4012) java.lang.IllegalArgumentException: Comparison method violates its general contract! SpillableMemoryManager

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173060#comment-14173060 ] Rohini Palaniswamy commented on PIG-4012: - This should just be a temporary exception

[jira] [Commented] (PIG-3979) group all performance, garbage collection, and incremental aggregation

2014-10-15 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173099#comment-14173099 ] Rohini Palaniswamy commented on PIG-3979: - Another comment. The tuplesize is

[jira] [Created] (PIG-4238) Property 'pig.job.converted.fetch' should be unset when fetch finishes

2014-10-15 Thread Lorand Bendig (JIRA)
Lorand Bendig created PIG-4238: -- Summary: Property 'pig.job.converted.fetch' should be unset when fetch finishes Key: PIG-4238 URL: https://issues.apache.org/jira/browse/PIG-4238 Project: Pig

[jira] [Commented] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173238#comment-14173238 ] Cheolsoo Park commented on PIG-4227: Yes, it works! I haven't verified the broken tests,

[jira] [Resolved] (PIG-4227) Streaming Python UDF handles bag outputs incorrectly

2014-10-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4227. - Resolution: Fixed Hadoop Flags: Reviewed Yes, test pass. Committed to trunk and 0.14 branch. Thanks