[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-12-03 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13508600#comment-13508600 ] Prashant Kommireddi commented on PIG-3047: -- Julien, Jon - what do you guys think

[jira] [Updated] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Shami B (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shami B updated PIG-2645: - Attachment: PIG-2645.patch Please find the attached file containing the fix for the issue.

[jira] [Updated] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Shami B (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shami B updated PIG-2645: - Labels: patch (was: ) Affects Version/s: 0.10.0 Release Note: Added a NULL check for

[jira] [Commented] (PIG-2134) ReadScalars message scalar has more than one row in the output does not provide enough information to help programmer find and fix script syntax error.

2012-12-03 Thread Peter Lubell-Doughtie (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13508778#comment-13508778 ] Peter Lubell-Doughtie commented on PIG-2134: I also encountered the same issue,

Re: Review Request: PIG-3015 Rewrite of AvroStorage

2012-12-03 Thread Cheolsoo Park
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/8104/#review13962 --- Overall looks great! I haven't gone through the test cases yet, but

[jira] [Created] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
Koji Noguchi created PIG-3072: - Summary: Pig job reporting negative progress Key: PIG-3072 URL: https://issues.apache.org/jira/browse/PIG-3072 Project: Pig Issue Type: Bug Components:

[jira] [Commented] (PIG-2653) Precompile option in PIG (Ability to store the plan for queries which are run multiple times)

2012-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/PIG-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509022#comment-13509022 ] Minh LĂȘ commented on PIG-2653: -- Isn't it better to store the compiled jar file? Eventually,

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v01.txt 'pos' inside TFileRecordReader was never updated. Using fileIn.getPos()

[jira] [Commented] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509101#comment-13509101 ] Alex Levenson commented on PIG-2645: Style nitpicks: Don't use a javadoc, use a regular

Build failed in Jenkins: Pig-trunk #1370

2012-12-03 Thread Apache Jenkins Server
See https://builds.apache.org/job/Pig-trunk/1370/changes Changes: [cheolsoo] PIG-3058: Upgrade junit to at least 4.8 (fang fang chen via cheolsoo) -- [...truncated 6644 lines...] [findbugs] jline.History [findbugs]

[jira] [Created] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
Jonathan Coveney created PIG-3073: - Summary: POUserFunc creating log spam for large scripts Key: PIG-3073 URL: https://issues.apache.org/jira/browse/PIG-3073 Project: Pig Issue Type: Bug

[jira] [Updated] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Coveney updated PIG-3073: -- Attachment: PIG-3073-0.patch Here is a fix for it... I'm open to other opinions on how to fix

[jira] [Updated] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Coveney updated PIG-3073: -- Affects Version/s: 0.12 Status: Patch Available (was: Open) POUserFunc

Re: Review Request: PIG-3015 Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler
On Dec. 3, 2012, 7:22 p.m., Cheolsoo Park wrote: src/org/apache/pig/builtin/AvroStorage.java, lines 171-172 https://reviews.apache.org/r/8104/diff/1/?file=191564#file191564line171 Same problem as above. Fixing this one within getAvroSchema On Dec. 3, 2012, 7:22 p.m., Cheolsoo

[jira] [Commented] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509296#comment-13509296 ] Joseph Adler commented on PIG-3015: --- I made most of the recommended changes (thanks for

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3072: Affects Version/s: 0.10.0 Fix Version/s: 0.12 Just a minor comment. Can we create

[jira] [Commented] (PIG-2614) AvroStorage crashes on LOADING a single bad error

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509325#comment-13509325 ] Joseph Adler commented on PIG-2614: --- Could I propose an alternative? I like this

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v02.txt bq. Can we create the basic.tfile using Util.createTempFileDelOnExit().

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Status: Patch Available (was: Open) Pig job reporting negative progress

[jira] [Commented] (PIG-2614) AvroStorage crashes on LOADING a single bad error

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509342#comment-13509342 ] Jonathan Coveney commented on PIG-2614: --- I imagine we could make this work for any

[jira] [Commented] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509345#comment-13509345 ] Rohini Palaniswamy commented on PIG-3072: - +1. Will commit soon.

[jira] [Created] (PIG-3074) Should check that any UDF returns a Schema of only one Field

2012-12-03 Thread Jonathan Coveney (JIRA)
Jonathan Coveney created PIG-3074: - Summary: Should check that any UDF returns a Schema of only one Field Key: PIG-3074 URL: https://issues.apache.org/jira/browse/PIG-3074 Project: Pig

[jira] Subscription: PIG patch available

2012-12-03 Thread jira
Issue Subscription Filter: PIG patch available (33 issues) Subscriber: pigdaily Key Summary PIG-3073POUserFunc creating log spam for large scripts https://issues.apache.org/jira/browse/PIG-3073 PIG-3072Pig job reporting negative progress

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Status: Open (was: Patch Available) replacing with revised patch Rewrite of

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Attachment: (was: PIG-3015.patch) Rewrite of AvroStorage --

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Status: Patch Available (was: Open) Revised patch; reflects comments and suggestions from review board

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Attachment: PIG-3015.patch Revised patch (compiles together all changes) Rewrite of

Re: PIG Reference Project for Learning Purposes

2012-12-03 Thread Jonathan Coveney
Hey Adarsh! I would recommend going through the open, but uncompleted JIRA's for Pig. If one looks like something you can tackle, we're all willing to help out. Jon 2012/12/2 Adarsh Sastry adarsh.sas...@yahoo.com Hello: I am new to HADOOP. I had gone through and practiced various topics and

[jira] [Created] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
Nick White created PIG-3075: --- Summary: Allow AvroStorage STORE Operations To Use Schema Specified By URI Key: PIG-3075 URL: https://issues.apache.org/jira/browse/PIG-3075 Project: Pig Issue Type:

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509411#comment-13509411 ] Jonathan Coveney commented on PIG-3047: --- Prashant: that sounds good to me. Just make

[jira] [Updated] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick White updated PIG-3075: Attachment: PIG-3075.patch Patch generated by: git diff --cached --binary PIG-3075.patch

[jira] [Updated] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick White updated PIG-3075: Fix Version/s: 0.12 Affects Version/s: 0.10.0 Status: Patch Available (was: Open)

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v03.txt (Minor changes to previous v02 patch.) * Added comment in

Re: Reducer estimation

2012-12-03 Thread Bill Graham
We do basically what you're describing. Each of our scripts has a logical name which defines the workflow. For each job in the workflow we persist the job stats, counters and conf in HBase via an implementation of PigProgressNotificationListener. We can then correlate jobs in a run of the workflow

Re: Reducer estimation

2012-12-03 Thread Prashant Kommireddi
Awesome! It would be good to have a flat-file based impl as there will probably a lot of pig users not having an hbase instance setup for stats persistence. Let me know if I can help in anyway. Is there a timeframe you are looking at for open-sourcing this? On Dec 4, 2012, at 12:32 PM, Bill

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Status: Patch Available (was: Open) test-patch failed with javadoc warnings

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Status: Open (was: Patch Available) test-patch failed with javadoc warnings

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Attachment: PIG-3033-trunk-2.patch test-patch failed with javadoc warnings

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Attachment: (was: PIG-3033-trunk.patch) test-patch failed with javadoc warnings

[jira] [Commented] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13509559#comment-13509559 ] fang fang chen commented on PIG-3033: - Hi Cheolsoo, Updated the patch per our