[jira] [Updated] (PIG-483) PERFORMANCE: different strategies for large and small order bys

2012-09-05 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy V. Ryaboy updated PIG-483: -- Assignee: Jie Li Status: Patch Available (was: Open) PERFORMANCE: different

[jira] [Commented] (PIG-483) PERFORMANCE: different strategies for large and small order bys

2012-09-05 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13448536#comment-13448536 ] Dmitriy V. Ryaboy commented on PIG-483: --- I think Bill's suggestion is cleaner, but in

[jira] [Updated] (PIG-2853) PiggyBank XMLLoaderBufferedPositionedInputStream incorrectly inherits from BufferedPositionedInputStream and so uses unbuffered io reading byte-by-byte

2012-09-05 Thread Vitalii Tymchyshyn (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitalii Tymchyshyn updated PIG-2853: Affects Version/s: 0.9.1 PiggyBank XMLLoaderBufferedPositionedInputStream incorrectly

Build failed in Jenkins: Pig-trunk #1311

2012-09-05 Thread Apache Jenkins Server
See https://builds.apache.org/job/Pig-trunk/1311/changes Changes: [billgraham] PIG-2905: Improve documentation around REPLACE (cheolsoo via billgraham) -- [...truncated 6860 lines...] [findbugs] com.google.common.util.concurrent.CheckedFuture

[jira] [Updated] (PIG-717) to make docs target portable b/w windows and unix flavours

2012-09-05 Thread Bill Graham (JIRA)
[ https://issues.apache.org/jira/browse/PIG-717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bill Graham updated PIG-717: Status: Open (was: Patch Available) This patch seems to have been overlooked for quite some time. It no

[jira] [Commented] (PIG-2846) Can we skip hcat related e2e when hcat is not installed?

2012-09-05 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449039#comment-13449039 ] Koji Noguchi commented on PIG-2846: --- Thanks Alan for taking a look at my patch! bq. I

[jira] [Commented] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Francis Liu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449148#comment-13449148 ] Francis Liu commented on PIG-2712: -- I'm going to take a crack at this patch.

[jira] [Created] (PIG-2907) Publish pig 0.23 jars to maven

2012-09-05 Thread Francis Liu (JIRA)
Francis Liu created PIG-2907: Summary: Publish pig 0.23 jars to maven Key: PIG-2907 URL: https://issues.apache.org/jira/browse/PIG-2907 Project: Pig Issue Type: New Feature Reporter:

[jira] [Commented] (PIG-2712) Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat

2012-09-05 Thread Francis Liu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449173#comment-13449173 ] Francis Liu commented on PIG-2712: -- Just spoke with Rohini offline, she has a patch for

[jira] [Created] (PIG-2908) Fix unit tests to work with jdk7

2012-09-05 Thread Rohini Palaniswamy (JIRA)
Rohini Palaniswamy created PIG-2908: --- Summary: Fix unit tests to work with jdk7 Key: PIG-2908 URL: https://issues.apache.org/jira/browse/PIG-2908 Project: Pig Issue Type: Test

Modifying databag on the fly

2012-09-05 Thread Prasanth J
Hello devs I have specific case where I need to modify the contents (remove a field from each tuples) of Databag but I want to do it in-place and do not want to create another databag with new set of tuples. The situation is, say I have the following input tuple for an UDF {(111,222,3,121),

Re: Modifying databag on the fly

2012-09-05 Thread Alan Gates
You cannot modify a bag once it is written. The implementation is written around the assumption that bags are immutable after they are written. Creating a new bag should not create an OOM exception, as bags are built to spill when they grow too large. In fact it's this spilling feature that

Re: Modifying databag on the fly

2012-09-05 Thread Prasanth J
Ahh.. Now it makes more sense. I think I got the solution. I was adding to ListTuple and then finally creating a DataBag with that list.. Instead I should create a bag and keep adding to it..!! Is that correct? Thanks Alan. Thanks -- Prasanth On Sep 5, 2012, at 9:24 PM, Alan Gates

Re: Modifying databag on the fly

2012-09-05 Thread Alan Gates
On Sep 5, 2012, at 6:30 PM, Prasanth J wrote: Ahh.. Now it makes more sense. I think I got the solution. I was adding to ListTuple and then finally creating a DataBag with that list.. Instead I should create a bag and keep adding to it..!! Is that correct? Yes. Alan. Thanks Alan.

[jira] [Commented] (PIG-2831) MR-Cube implementation (Distributed cubing for holistic measures)

2012-09-05 Thread Prasanth J (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449413#comment-13449413 ] Prasanth J commented on PIG-2831: - Attaching a patch that fixed GC errors in PostProcessCube

[jira] [Commented] (PIG-2829) Use partial aggregation more aggresively

2012-09-05 Thread Jie Li (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449424#comment-13449424 ] Jie Li commented on PIG-2829: - No problem Dmitriy. I'll see if I can find some time over the

[jira] [Created] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
Cheolsoo Park created PIG-2909: -- Summary: Add a new option for ignoring corrupted files to AvroStorage load func Key: PIG-2909 URL: https://issues.apache.org/jira/browse/PIG-2909 Project: Pig

[jira] [Updated] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-2909: --- Attachment: PIG-2909.patch PIG-2909-avro_test_files.tar.gz Attached is a patch that

[jira] [Updated] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-2909: --- Status: Patch Available (was: Open) Add a new option for ignoring corrupted files to AvroStorage

[jira] [Commented] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func

2012-09-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13449437#comment-13449437 ] Cheolsoo Park commented on PIG-2909: Review board: https://reviews.apache.org/r/6940/