[jira] [Updated] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-2712: Fix Version/s: 0.10.1 0.11 Assignee: Rohini Palaniswamy > Pig

[jira] [Updated] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-2712: Status: Patch Available (was: Open) > Pig does not call OutputCommitter.abortJob() on th

[jira] [Updated] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-2712: Attachment: PIG-2712-1-trunk.patch PIG-2712-1-branch10.patch Tested with 20.2

[jira] [Updated] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-2909: --- Status: Patch Available (was: Open) > Add a new option for ignoring corrupted files to AvroStorage

[jira] [Commented] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449437#comment-13449437 ] Cheolsoo Park commented on PIG-2909: Review board: https://reviews.apache.org/r/6940/

[jira] [Updated] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-2909: --- Attachment: PIG-2909.patch PIG-2909-avro_test_files.tar.gz Attached is a patch that add

[jira] [Created] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
Cheolsoo Park created PIG-2909: -- Summary: Add a new option for ignoring corrupted files to AvroStorage load func Key: PIG-2909 URL: https://issues.apache.org/jira/browse/PIG-2909 Project: Pig I

[jira] [Commented] (PIG-2829) Use partial aggregation more aggresively

2012-09-05 Thread Jie Li (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449424#comment-13449424 ] Jie Li commented on PIG-2829: - No problem Dmitriy. I'll see if I can find some time over the wee

Re: Review Request: PIG-2831: MR-Cube implementation (Distributed cubing for holistic measures)

2012-09-05 Thread j . prasanth . j
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/6651/ --- (Updated Sept. 6, 2012, 4:51 a.m.) Review request for pig and Dmitriy Ryaboy.

[jira] [Updated] (PIG-2831) MR-Cube implementation (Distributed cubing for holistic measures)

2012-09-05 Thread Prasanth J (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth J updated PIG-2831: Attachment: PIG-2831.6.git.patch > MR-Cube implementation (Distributed cubing for holistic measures) > --

[jira] [Commented] (PIG-2831) MR-Cube implementation (Distributed cubing for holistic measures)

2012-09-05 Thread Prasanth J (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449413#comment-13449413 ] Prasanth J commented on PIG-2831: - Attaching a patch that fixed GC errors in PostProcessCube

Re: Modifying databag on the fly

2012-09-05 Thread Alan Gates
On Sep 5, 2012, at 6:30 PM, Prasanth J wrote: > Ahh.. Now it makes more sense. > > I think I got the solution. I was adding to List and then finally > creating a DataBag with that list.. Instead I should create a bag and keep > adding to it..!! Is that correct? Yes. Alan. > Thanks Alan. >

Re: Modifying databag on the fly

2012-09-05 Thread Prasanth J
Ahh.. Now it makes more sense. I think I got the solution. I was adding to List and then finally creating a DataBag with that list.. Instead I should create a bag and keep adding to it..!! Is that correct? Thanks Alan. Thanks -- Prasanth On Sep 5, 2012, at 9:24 PM, Alan Gates wrote: > You

Re: Modifying databag on the fly

2012-09-05 Thread Alan Gates
You cannot modify a bag once it is written. The implementation is written around the assumption that bags are immutable after they are written. Creating a new bag should not create an OOM exception, as bags are built to spill when they grow too large. In fact it's this spilling feature that

Modifying databag on the fly

2012-09-05 Thread Prasanth J
Hello devs I have specific case where I need to modify the contents (remove a field from each tuples) of Databag but I want to do it in-place and do not want to create another databag with new set of tuples. The situation is, say I have the following input tuple for an UDF {(111,222,3,121), (1

[jira] [Created] (PIG-2908) Fix unit tests to work with jdk7

2012-09-05 Thread Rohini Palaniswamy (JIRA)
Rohini Palaniswamy created PIG-2908: --- Summary: Fix unit tests to work with jdk7 Key: PIG-2908 URL: https://issues.apache.org/jira/browse/PIG-2908 Project: Pig Issue Type: Test R

[jira] [Commented] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Francis Liu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449173#comment-13449173 ] Francis Liu commented on PIG-2712: -- Just spoke with Rohini offline, she has a patch for thi

[jira] [Created] (PIG-2907) Publish pig 0.23 jars to maven

2012-09-05 Thread Francis Liu (JIRA)
Francis Liu created PIG-2907: Summary: Publish pig 0.23 jars to maven Key: PIG-2907 URL: https://issues.apache.org/jira/browse/PIG-2907 Project: Pig Issue Type: New Feature Reporter:

[jira] [Commented] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Francis Liu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449148#comment-13449148 ] Francis Liu commented on PIG-2712: -- I'm going to take a crack at this patch.

[jira] [Commented] (PIG-2846) Can we skip hcat related e2e when hcat is not installed?

2012-09-05 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449039#comment-13449039 ] Koji Noguchi commented on PIG-2846: --- Thanks Alan for taking a look at my patch! bq. I tr

[jira] [Updated] (PIG-717) to make docs target portable b/w windows and unix flavours

2012-09-05 Thread Bill Graham (JIRA)
[ https://issues.apache.org/jira/browse/PIG-717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bill Graham updated PIG-717: Status: Open (was: Patch Available) This patch seems to have been overlooked for quite some time. It no longe

Build failed in Jenkins: Pig-trunk #1311

2012-09-05 Thread Apache Jenkins Server
See Changes: [billgraham] PIG-2905: Improve documentation around REPLACE (cheolsoo via billgraham) -- [...truncated 6860 lines...] [findbugs] com.google.common.util.concurrent.CheckedFuture [findb

[jira] [Updated] (PIG-2853) PiggyBank XMLLoaderBufferedPositionedInputStream incorrectly inherits from BufferedPositionedInputStream and so uses unbuffered io reading byte-by-byte

2012-09-05 Thread Vitalii Tymchyshyn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitalii Tymchyshyn updated PIG-2853: Affects Version/s: 0.9.1 > PiggyBank XMLLoaderBufferedPositionedInputStream incorrectly i

[jira] [Commented] (PIG-483) PERFORMANCE: different strategies for large and small order bys

2012-09-05 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13448536#comment-13448536 ] Dmitriy V. Ryaboy commented on PIG-483: --- I think Bill's suggestion is cleaner, but in t

[jira] [Updated] (PIG-483) PERFORMANCE: different strategies for large and small order bys

2012-09-05 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy V. Ryaboy updated PIG-483: -- Assignee: Jie Li Status: Patch Available (was: Open) > PERFORMANCE: different strategie

[jira] [Commented] (PIG-2829) Use partial aggregation more aggresively

2012-09-05 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13448528#comment-13448528 ] Dmitriy V. Ryaboy commented on PIG-2829: Jie, sorry I missed this ticket before. As