[jira] Commented: (PIG-872) use distributed cache for the replicated data set in FR join

2009-11-23 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PIG-872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781415#action_12781415 ] Hadoop QA commented on PIG-872: --- -1 overall. Here are the results of testing the latest attachm

[jira] Updated: (PIG-872) use distributed cache for the replicated data set in FR join

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich updated PIG-872: --- Status: Open (was: Patch Available) > use distributed cache for the replicated data set in FR join > --

[jira] Updated: (PIG-872) use distributed cache for the replicated data set in FR join

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich updated PIG-872: --- Status: Patch Available (was: Open) resubmitting the patch. looks like we had problems running tests >

[jira] Updated: (PIG-1091) [zebra] Exception when load with projection of map keys on a map column that is not map split

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1091: -- Fix Version/s: 0.6.0 > [zebra] Exception when load with projection of map keys on a map column that > is not map

[jira] Resolved: (PIG-524) ORDER (x,y) gives syntax error

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-524. Resolution: Duplicate This duplicate of PIG-900 > ORDER (x,y) gives syntax error > --

[jira] Resolved: (PIG-807) PERFORMANCE: Provide a way for UDFs to use read-once bags (backed by the Hadoop values iterator)

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-807. Resolution: Won't Fix accumulator interface has been introduced for UDFs to solve this issue > PERFOR

[jira] Updated: (PIG-1088) change merge join and merge join indexer to work with new LoadFunc interface

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thejas M Nair updated PIG-1088: --- Attachment: PIG-1088.1.patch Changes address Pradeep's comments. All mergejoin test cases pass. Also r

[jira] Resolved: (PIG-843) PERFORMANCE: improvements in memory management

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-843. Resolution: Fixed I believe memory issue has been sufficiently addressed. > PERFORMANCE: improvement

[jira] Updated: (PIG-1078) [zebra] merge join with empty table failed

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1078: -- Fix Version/s: 0.7.0 > [zebra] merge join with empty table failed > -- >

[jira] Updated: (PIG-1074) Zebra store function should allow '::' in column names in output schema

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1074: -- Fix Version/s: 0.7.0 0.6.0 > Zebra store function should allow '::' in column names in output

[jira] Updated: (PIG-1098) [zebra] Zebra Performance Optimizations

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1098: -- Fix Version/s: 0.7.0 0.6.0 > [zebra] Zebra Performance Optimizations > ---

[jira] Updated: (PIG-1095) [zebra] Schema support of anonymous fields in COLECTION fails

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1095: -- Fix Version/s: 0.7.0 0.6.0 > [zebra] Schema support of anonymous fields in COLECTION fails > -

Re: TPC-H benchmark

2009-11-23 Thread Alan Gates
I don't know of any. Officially Pig cannot publish a TPC-H number because it is not a transaction based store. But I still think it would be very interesting to see the results if someone took the time to translate the queries. Alan. On Nov 22, 2009, at 6:20 PM, RichardGUO Fei wrote:

[jira] Resolved: (PIG-844) PERFORMANCE: streaming data to the UDFs in foreach

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-844. accumulate interface took care of this. > PERFORMANCE: streaming data to the UDFs in foreach > --

[jira] Resolved: (PIG-856) PERFORMANCE: reduce number of replicas

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-856. Resolution: Won't Fix We tried reducing the number of replicas and the performance actually degraded

[jira] Updated: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thejas M Nair updated PIG-598: -- Status: Open (was: Patch Available) > Parameter substitution ($PARAMETER) should not be performed in comm

[jira] Updated: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thejas M Nair updated PIG-598: -- Patch Info: (was: [Patch Available]) > Parameter substitution ($PARAMETER) should not be performed in co

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781510#action_12781510 ] Thejas M Nair commented on PIG-598: --- bq. One issue I faced while working on PIG-928 was when

[jira] Reopened: (PIG-1090) Update sources to reflect recent changes in load-store interfaces

2009-11-23 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath reopened PIG-1090: - Reopening since we need to implement LoadMetadata interface in BinStorage so as to implement the getSc

[jira] Updated: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thejas M Nair updated PIG-598: -- Attachment: PIG-598.1.patch Additional changes in this patch- * Fixed parsing in PigFileParser.jj * Modi

[jira] Updated: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thejas M Nair updated PIG-598: -- Status: Patch Available (was: Open) > Parameter substitution ($PARAMETER) should not be performed in comm

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781546#action_12781546 ] Ashutosh Chauhan commented on PIG-598: -- I guess my question is what should be the behavio

[jira] Commented: (PIG-1091) [zebra] Exception when load with projection of map keys on a map column that is not map split

2009-11-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781551#action_12781551 ] Alan Gates commented on PIG-1091: - Patch applied to 0.6 branch. > [zebra] Exception when lo

[jira] Commented: (PIG-872) use distributed cache for the replicated data set in FR join

2009-11-23 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PIG-872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781560#action_12781560 ] Hadoop QA commented on PIG-872: --- -1 overall. Here are the results of testing the latest attachm

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781562#action_12781562 ] Thejas M Nair commented on PIG-598: --- I prefer option a. It is just a matter of putting a "\"

[jira] Assigned: (PIG-1078) [zebra] merge join with empty table failed

2009-11-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1078: --- Assignee: Yan Zhou > [zebra] merge join with empty table failed > -

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781602#action_12781602 ] Ashutosh Chauhan commented on PIG-598: -- bq. compared to the cost of time spending debugg

[jira] Commented: (PIG-1088) change merge join and merge join indexer to work with new LoadFunc interface

2009-11-23 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781608#action_12781608 ] Pradeep Kamath commented on PIG-1088: - The patch did not include tests since there were e

[jira] Updated: (PIG-1088) change merge join and merge join indexer to work with new LoadFunc interface

2009-11-23 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-1088: Resolution: Fixed Hadoop Flags: [Incompatible change, Reviewed] Status: Resolved (was

[jira] Created: (PIG-1102) Collect number of spills per job

2009-11-23 Thread Olga Natkovich (JIRA)
Collect number of spills per job Key: PIG-1102 URL: https://issues.apache.org/jira/browse/PIG-1102 Project: Pig Issue Type: Improvement Reporter: Olga Natkovich Assignee: Sriranjan Man

[jira] Created: (PIG-1103) refactor test-commit

2009-11-23 Thread Olga Natkovich (JIRA)
refactor test-commit Key: PIG-1103 URL: https://issues.apache.org/jira/browse/PIG-1103 Project: Pig Issue Type: Task Reporter: Olga Natkovich Assignee: Olga Natkovich Due to the changes to the l

[jira] Updated: (PIG-872) use distributed cache for the replicated data set in FR join

2009-11-23 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich updated PIG-872: --- Resolution: Fixed Fix Version/s: 0.7.0 Status: Resolved (was: Patch Available) Patch

Build failed in Hudson: Pig-trunk #627

2009-11-23 Thread Apache Hudson Server
See Changes: [olga] PIG-872: use distributed cache for the replicated data set in FR join (sriranjan via olgan) -- [...truncated 2679 lines...] ivy-init-dirs: ivy-probe-antlib: ivy-init-a

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-11-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781652#action_12781652 ] Alan Gates commented on PIG-966: Updated http://wiki.apache.org/pig/LoadStoreRedesignProposal

[jira] Updated: (PIG-1078) [zebra] merge join with empty table failed

2009-11-23 Thread Chao Wang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Wang updated PIG-1078: --- Patch reviewed. +1 > [zebra] merge join with empty table failed > -- > >

[jira] Commented: (PIG-1101) Pig parser does not recognize its own data type in LIMIT statement

2009-11-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781674#action_12781674 ] Alan Gates commented on PIG-1101: - I'll review this patch. > Pig parser does not recognize i

[jira] Created: (PIG-1104) [zebra] Provide streaming support in Zebra.

2009-11-23 Thread Chao Wang (JIRA)
[zebra] Provide streaming support in Zebra. --- Key: PIG-1104 URL: https://issues.apache.org/jira/browse/PIG-1104 Project: Pig Issue Type: New Feature Affects Versions: 0.4.0 Reporter:

[jira] Updated: (PIG-1095) [zebra] Schema support of anonymous fields in COLECTION fails

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1095: -- Attachment: PIG-1095.patch > [zebra] Schema support of anonymous fields in COLECTION fails >

[jira] Updated: (PIG-1095) [zebra] Schema support of anonymous fields in COLECTION fails

2009-11-23 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1095: -- Status: Patch Available (was: Open) > [zebra] Schema support of anonymous fields in COLECTION fails > --

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-11-23 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781692#action_12781692 ] Dmitriy V. Ryaboy commented on PIG-966: --- LoadFunc has a method called determineSchema, n

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-11-23 Thread Dmitriy V. Ryaboy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781695#action_12781695 ] Dmitriy V. Ryaboy commented on PIG-966: --- Regarding Streaming: We should support "Typed

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781705#action_12781705 ] Hadoop QA commented on PIG-598: --- -1 overall. Here are the results of testing the latest attachm

[jira] Commented: (PIG-598) Parameter substitution ($PARAMETER) should not be performed in comments

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781711#action_12781711 ] Thejas M Nair commented on PIG-598: --- bq. -1 javac. The applied patch generated 213 javac com

[jira] Commented: (PIG-1016) Reading in map data seems broken

2009-11-23 Thread Thejas M Nair (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781727#action_12781727 ] Thejas M Nair commented on PIG-1016: I agree with hc busy that PigStorage in current stat

[jira] Commented: (PIG-1095) [zebra] Schema support of anonymous fields in COLECTION fails

2009-11-23 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781781#action_12781781 ] Hadoop QA commented on PIG-1095: +1 overall. Here are the results of testing the latest atta

Re: TPC-H benchmark

2009-11-23 Thread Jeff Hammerbacher
Hey, It's not Pig, but if you're looking for TPC-H on Hadoop, the Hive team has run the TPC-H benchmarks: http://issues.apache.org/jira/browse/HIVE-600. Regards, Jeff 2009/11/23 Alan Gates > I don't know of any. Officially Pig cannot publish a TPC-H number because > it is not a transaction ba