[jira] Updated: (PIG-893) support cast of chararray to other simple types

2009-08-03 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated PIG-893: --- Fix Version/s: 0.4.0 Affects Version/s: 0.4.0 Status: Patch Available (was: Open) > supp

[jira] Updated: (PIG-893) support cast of chararray to other simple types

2009-08-03 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated PIG-893: --- Attachment: Pig_893_Patch.txt attach the patch including the TestCase. I extract the bytesTo* method from Utf8S

[jira] Assigned: (PIG-893) support cast of chararray to other simple types

2009-08-03 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned PIG-893: -- Assignee: Jeff Zhang > support cast of chararray to other simple types > -

[jira] Commented: (PIG-592) schema inferred incorrectly

2009-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738472#action_12738472 ] Daniel Dai commented on PIG-592: Also the following script produce the wrong schema: a = load

[jira] Created: (PIG-900) ORDER BY syntax wrt parentheses is somewhat different than GROUP BY and FILTER BY

2009-08-03 Thread David Ciemiewicz (JIRA)
ORDER BY syntax wrt parentheses is somewhat different than GROUP BY and FILTER BY - Key: PIG-900 URL: https://issues.apache.org/jira/browse/PIG-900 Project: Pig

[jira] Updated: (PIG-900) ORDER BY syntax wrt parentheses is somewhat different than GROUP BY and FILTER BY

2009-08-03 Thread David Ciemiewicz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Ciemiewicz updated PIG-900: - Description: With GROUP BY, you must put parentheses around the aliases in the BY clause: {code} B

[jira] Updated: (PIG-900) ORDER BY syntax wrt parentheses is somewhat different than GROUP BY and FILTER BY

2009-08-03 Thread David Ciemiewicz (JIRA)
[ https://issues.apache.org/jira/browse/PIG-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Ciemiewicz updated PIG-900: - Description: With GROUP BY, you must put parentheses around the aliases in the BY clause: {code} B

[jira] Created: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Pradeep Kamath (JIRA)
InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext Key: PIG-901 URL: https://issues.apache.org/jira/browse/PIG-901 Project: Pig

[jira] Created: (PIG-902) Allow schema matching for UDF with variable length arguments

2009-08-03 Thread Daniel Dai (JIRA)
Allow schema matching for UDF with variable length arguments Key: PIG-902 URL: https://issues.apache.org/jira/browse/PIG-902 Project: Pig Issue Type: Improvement Componen

[jira] Commented: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738496#action_12738496 ] Daniel Dai commented on PIG-901: PigContext.packageImportList needs to be serialized as well.

[jira] Updated: (PIG-882) log level not propogated to loggers

2009-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-882: --- Resolution: Fixed Status: Resolved (was: Patch Available) All unit tests passed. Hudson do not respond,

PushUpProject optimization design

2009-08-03 Thread Daniel Dai
Hi, Pig need to prune unused columns as early as possible. Here is a design about how to implement it in Pig. Presumption: * Prune columns of loader, save time for record parsing a = load 'a' as (n1:chararray, n2:chararray, n3:chararray); b = foreach a generate n1, n2; => a = load 'a'

[jira] Updated: (PIG-200) Pig Performance Benchmarks

2009-08-03 Thread Ying He (JIRA)
[ https://issues.apache.org/jira/browse/PIG-200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ying He updated PIG-200: Attachment: perf.hadoop.patch perf.hadoop.patch is used to support running DataGenerator in hadoop mode. It should be

[jira] Resolved: (PIG-67) FileLocalizer doesn't work on reduce sise

2009-08-03 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich resolved PIG-67. --- Resolution: Fixed This issues has been resolved by PIG https://issues.apache.org/jira/browse/PIG-792 >

[jira] Created: (PIG-903) ILLUSTRATE fails on 'Distinct' operator

2009-08-03 Thread Dmitriy V. Ryaboy (JIRA)
ILLUSTRATE fails on 'Distinct' operator --- Key: PIG-903 URL: https://issues.apache.org/jira/browse/PIG-903 Project: Pig Issue Type: Bug Reporter: Dmitriy V. Ryaboy Using the latest Pig from

[jira] Issue Comment Edited: (PIG-200) Pig Performance Benchmarks

2009-08-03 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738556#action_12738556 ] Olga Natkovich edited comment on PIG-200 at 8/3/09 2:01 PM: perf.h

[jira] Commented: (PIG-200) Pig Performance Benchmarks

2009-08-03 Thread Ying He (JIRA)
[ https://issues.apache.org/jira/browse/PIG-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738609#action_12738609 ] Ying He commented on PIG-200: - doc for DataGenerator in hadoop mode is here: http://wiki.apache.o

[jira] Updated: (PIG-660) Integration with Hadoop 0.20

2009-08-03 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-660: --- Attachment: PIG-660-for-branch-0.3.patch Attached a patch for "branch-0.3" based on PIG-660_5.patch. The

[jira] Updated: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-901: --- Attachment: PIG-901-1.patch Add a unit test to make sure this change will not affect udf.import.list > InputSpl

[jira] Created: (PIG-904) Conversion from double to chararray for udf input arguments does not occur

2009-08-03 Thread Pradeep Kamath (JIRA)
Conversion from double to chararray for udf input arguments does not occur -- Key: PIG-904 URL: https://issues.apache.org/jira/browse/PIG-904 Project: Pig Issue Type: Bu

[jira] Updated: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-901: --- Attachment: PIG-901-branch-0.3.patch Patch for 0.3 branch > InputSplit (SliceWrapper) created by Pig is

[jira] Commented: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738734#action_12738734 ] Olga Natkovich commented on PIG-901: +1 on the patch for the 0.3 branch. Please, commit >

[jira] Commented: (PIG-901) InputSplit (SliceWrapper) created by Pig is big in size due to serialized PigContext

2009-08-03 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738740#action_12738740 ] Arun C Murthy commented on PIG-901: --- It would be nice to add a test case which (for now) che

Re: Is it possible to access Configuration in UDF ?

2009-08-03 Thread zhang jianfeng
Dmitriy, Thank you for your help. I find this way of using API is not so intuitive , I recommend the base class of UDF to implements the Configurable interface. Then each UDF can use the getConf() to get the Configuration object. Because UDF is part of MapReduce , it makes sense to make it Confi

Re: Is it possible to access Configuration in UDF ?

2009-08-03 Thread Daniel Dai
Hi, Jeff, This is not API at all, this is a hack to make things work. We do lack couples of features for UDF: 1. reporter and counter (PIG-889) 2. access global properties 3. ability to maintain states across different UDF invocations 4. input schema 5. variable length arguments (PIG-902) Your

[jira] Assigned: (PIG-891) Fixing dfs statement for Pig

2009-08-03 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned PIG-891: -- Assignee: Jeff Zhang > Fixing dfs statement for Pig > > > Key