Review Request 18832: PIG-3603: Add counters to TezStats

2014-03-06 Thread Cheolsoo Park
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/18832/ --- Review request for pig, Mark Wagner and Rohini Palaniswamy. Bugs: PIG-3603

Re: Review Request 18832: PIG-3603: Add counters to TezStats

2014-03-06 Thread Cheolsoo Park
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/18832/ --- (Updated March 6, 2014, 8 a.m.) Review request for pig, Mark Wagner and Rohini

[jira] [Updated] (PIG-3603) Add counters to TezStats

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3603: --- Attachment: PIG-3603-1.patch RB link- https://reviews.apache.org/r/18832/ Add counters to TezStats

Re: Review Request 15881: PIG-3591: Refactor POPackage

2014-03-06 Thread Rohini Palaniswamy
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/15881/#review36273 --- I am still going over the main classes POPackage and Packager

[jira] [Created] (PIG-3798) registered jar in pig script are appended to the classpath multiple times

2014-03-06 Thread Dotan Patrich (JIRA)
Dotan Patrich created PIG-3798: -- Summary: registered jar in pig script are appended to the classpath multiple times Key: PIG-3798 URL: https://issues.apache.org/jira/browse/PIG-3798 Project: Pig

[jira] [Updated] (PIG-3446) Umbrella jira for Pig on Tez

2014-03-06 Thread Rohit Laddha (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Laddha updated PIG-3446: -- Description: This is a umbrella jira for Pig on Tez. More detailed subtasks will be added. More

[jira] [Updated] (PIG-3797) Fix some memory leaks affecting container reuse

2014-03-06 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3797: Attachment: PIG-3797-1.patch Fix some memory leaks affecting container reuse

[jira] [Updated] (PIG-3797) Fix some memory leaks affecting container reuse

2014-03-06 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3797: Attachment: (was: PIG-3797-1.patch) Fix some memory leaks affecting container reuse

[jira] [Commented] (PIG-3793) Provide info on number of LogicalRelationalOperator(s) used in the script through LogicalPlanData

2014-03-06 Thread Kyungho Jeon (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922703#comment-13922703 ] Kyungho Jeon commented on PIG-3793: --- Thank you for the response. I found that

[jira] [Resolved] (PIG-3786) POReservoirSample should handle endOfAllInput flag

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park resolved PIG-3786. Resolution: Fixed Committed to tez branch. Thanks Daniel! POReservoirSample should handle

[jira] [Resolved] (PIG-3788) NPE when POStream is not in the leaf vertex

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park resolved PIG-3788. Resolution: Fixed Committed to tez branch. NPE when POStream is not in the leaf vertex

[jira] [Assigned] (PIG-3743) Use VertexGroup and Alias vertex for union

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park reassigned PIG-3743: -- Assignee: Cheolsoo Park Use VertexGroup and Alias vertex for union

[jira] [Created] (PIG-3799) TestCustomPartitioner is broken in tez branch

2014-03-06 Thread Cheolsoo Park (JIRA)
Cheolsoo Park created PIG-3799: -- Summary: TestCustomPartitioner is broken in tez branch Key: PIG-3799 URL: https://issues.apache.org/jira/browse/PIG-3799 Project: Pig Issue Type: Sub-task

[jira] [Updated] (PIG-3799) TestCustomPartitioner is broken in tez branch

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3799: --- Attachment: PIG-3799-1.patch The parallelism used to be set in POForeach, and that value used to

[jira] [Updated] (PIG-3799) TestCustomPartitioner is broken in tez branch

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3799: --- Status: Patch Available (was: Open) TestCustomPartitioner is broken in tez branch

[jira] [Comment Edited] (PIG-3799) TestCustomPartitioner is broken in tez branch

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922937#comment-13922937 ] Cheolsoo Park edited comment on PIG-3799 at 3/6/14 7:31 PM: The

[jira] [Commented] (PIG-3793) Provide info on number of LogicalRelationalOperator(s) used in the script through LogicalPlanData

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922950#comment-13922950 ] Prashant Kommireddi commented on PIG-3793: -- It's cool, that's how the project

[jira] [Commented] (PIG-3790) Several changes in Tez e2e

2014-03-06 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922969#comment-13922969 ] Rohini Palaniswamy commented on PIG-3790: - I am noticing that even when specifying

[jira] [Commented] (PIG-3797) Fix some memory leaks affecting container reuse

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923054#comment-13923054 ] Cheolsoo Park commented on PIG-3797: Why not use

[jira] [Updated] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3754: Description: If you have more than one input, InputSizeReducerEstimator.getTotalInputFileSize can

[jira] [Updated] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3754: Status: Patch Available (was: Open) InputSizeReducerEstimator.getTotalInputFileSize reports

[jira] [Updated] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3754: Attachment: PIG-3754-1.patch InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

[jira] [Commented] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923129#comment-13923129 ] Cheolsoo Park commented on PIG-3765: I made some comments in RB. Ability to disable

[jira] [Commented] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923158#comment-13923158 ] Cheolsoo Park commented on PIG-3754: Looks good. Would you mind adding a test case for

[jira] [Commented] (PIG-3731) Ability to specify local-mode specific configuration (useful for local/auto-local mode)

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923182#comment-13923182 ] Aniket Mokashi commented on PIG-3731: - Will do. Thanks for reviewing! Ability to

[jira] [Commented] (PIG-3731) Ability to specify local-mode specific configuration (useful for local/auto-local mode)

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923177#comment-13923177 ] Cheolsoo Park commented on PIG-3731: +1. When you commit it, can you remove this line?

[jira] [Commented] (PIG-3745) Document auto local mode for pig

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923185#comment-13923185 ] Cheolsoo Park commented on PIG-3745: Just a reminder. Let's include local-mode specific

[jira] [Commented] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923166#comment-13923166 ] Aniket Mokashi commented on PIG-3754: - Will do.

[jira] [Commented] (PIG-3731) Ability to specify local-mode specific configuration (useful for local/auto-local mode)

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923206#comment-13923206 ] Aniket Mokashi commented on PIG-3731: - Committed to trunk. Thanks again [~cheolsoo] for

pig12 job stuck in infinite loop

2014-03-06 Thread Suhas Satish
Hi I launched the attached pig job on pig-12 with hadoop MRv1 with the attached data, but the FILTER function causes the job to get stuck in an infinite loop. pig -p pPeriod=201312 -f test.pig The thread in question seems to be stuck forever inside while loop of runPipeline method. stack trace:

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Suhas Satish
This is the pig script - %default previousPeriod $pPeriod tWeek = LOAD '/tmp/test_data.txt' USING PigStorage ('|') AS (WEEK:int, DESCRIPTION:chararray, END_DATE:chararray, PERIOD:int); gTWeek = FOREACH tWeek GENERATE WEEK AS WEEK, PERIOD AS PERIOD; *pWeek = FILTER gTWeek BY PERIOD ==

[jira] Subscription: PIG patch available

2014-03-06 Thread jira
Issue Subscription Filter: PIG patch available (20 issues) Subscriber: pigdaily Key Summary PIG-3799TestCustomPartitioner is broken in tez branch https://issues.apache.org/jira/browse/PIG-3799 PIG-3797Fix some memory leaks affecting container reuse

[jira] [Updated] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3765: - Attachment: PIG-3765_4.patch Hi [~cheolsoo], I have uploaded a new patch to RB. Ability

[jira] [Commented] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923347#comment-13923347 ] Cheolsoo Park commented on PIG-3765: +1 assuming unit tests pass. Ability to disable

[jira] [Commented] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923348#comment-13923348 ] Cheolsoo Park commented on PIG-3765: Please open a jira to document this feature. I

[jira] [Comment Edited] (PIG-3799) TestCustomPartitioner is broken in tez branch

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922937#comment-13922937 ] Cheolsoo Park edited comment on PIG-3799 at 3/7/14 1:10 AM: The

[jira] [Created] (PIG-3800) Documentation for Pig whitelist and blacklist features

2014-03-06 Thread Prashant Kommireddi (JIRA)
Prashant Kommireddi created PIG-3800: Summary: Documentation for Pig whitelist and blacklist features Key: PIG-3800 URL: https://issues.apache.org/jira/browse/PIG-3800 Project: Pig Issue

[jira] [Updated] (PIG-3800) Documentation for Pig whitelist and blacklist features

2014-03-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3800: --- Fix Version/s: 0.13.0 Setting FixVersion to 0.13 so we will do this before 0.13 release.

[jira] [Commented] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923365#comment-13923365 ] Julien Le Dem commented on PIG-3754: LGTM too

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Suhas Satish
I seem to be hitting this issue in pig-0.12 although it claims to be fixed in pig-0.12 https://issues.apache.org/jira/browse/PIG-3395 Large filter expression makes Pig hang Cheers, Suhas. On Thu, Mar 6, 2014 at 4:26 PM, Suhas Satish suhas.sat...@gmail.com wrote: This is the pig script -

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Cheolsoo Park
Hi Suhas, No. The issue with PIG-3461 is that Pig hangs at the query compilation with a big filter expression before the job is submitted. In addition, the filter extractor was totally rewritten in 0.12. https://issues.apache.org/jira/browse/PIG-3461 Where exactly is your job hanging? Backend or

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Suhas Satish
Hi Cheolsoo, This is where its hanging - *pWeek = FILTER gTWeek BY PERIOD == $previousPeriod;* org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/ PigGenericMapBase.java: protected void *runPipeline*(PhysicalOperator leaf) throws IOException, InterruptedException { while(true){

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Cheolsoo Park
So that's backend. It has nothing to do with the filter extractor. The filter extractor is for predicate push down on the frontend. The code that you're showing is the entry point where Pig mapper begins. So it doesn't tell us much. The mapper is given a segment of physical plan (pipeline), and

Re: pig12 job stuck in infinite loop

2014-03-06 Thread Suhas Satish
The example that reproduces the issue along with data is attached in the very first email on this thread On Thursday, March 6, 2014, Cheolsoo Park piaozhe...@gmail.com wrote: So that's backend. It has nothing to do with the filter extractor. The filter extractor is for predicate push down on

[jira] [Commented] (PIG-3793) Provide info on number of LogicalRelationalOperator(s) used in the script through LogicalPlanData

2014-03-06 Thread Kyungho Jeon (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923575#comment-13923575 ] Kyungho Jeon commented on PIG-3793: --- Okay, I was confused. As {{PigServer}} is the only

[jira] [Updated] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3765: - Attachment: PIG-3765_5.patch Verified test-commit passes with the changes. However

[jira] [Commented] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923639#comment-13923639 ] Aniket Mokashi commented on PIG-3754: - Attached PIG-3754-2.patch with tests. Will commit

[jira] [Updated] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3754: Attachment: PIG-3754-2.patch InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

[jira] [Commented] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923641#comment-13923641 ] Aniket Mokashi commented on PIG-3754: - Committed to trunk. Thanks [~cheolsoo] and

[jira] [Updated] (PIG-3754) InputSizeReducerEstimator.getTotalInputFileSize reports incorrect size

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3754: Resolution: Fixed Status: Resolved (was: Patch Available)

[jira] [Updated] (PIG-3731) Ability to specify local-mode specific configuration (useful for local/auto-local mode)

2014-03-06 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aniket Mokashi updated PIG-3731: Resolution: Fixed Fix Version/s: 0.13.0 Status: Resolved (was: Patch Available)

[jira] [Commented] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923648#comment-13923648 ] Prashant Kommireddi commented on PIG-3765: -- Committed to trunk. Thanks [~cheolsoo]

[jira] [Updated] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3765: - Resolution: Fixed Status: Resolved (was: Patch Available) Ability to disable Pig

[jira] [Updated] (PIG-3765) Ability to disable Pig commands and operators

2014-03-06 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3765: - Fix Version/s: 0.13.0 Ability to disable Pig commands and operators