[jira] Created: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Ankur (JIRA)
Dump does not respect the schema Key: PIG-1238 URL: https://issues.apache.org/jira/browse/PIG-1238 Project: Pig Issue Type: Bug Affects Versions: 0.6.0 Reporter: Ankur For complex data type

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834151#action_12834151 ] Ankur commented on PIG-1238: Here is a script to reproduce the issue:- A = LOAD 'two.txt' USING

[jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834362#action_12834362 ] Richard Ding commented on PIG-1188: --- Actually, Pig is already padding nulls to the input

[jira] Updated: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Gaurav Jain (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurav Jain updated PIG-1115: - Attachment: PIG-1115.patch Patch for the fix. We rely on application to call BTOF.close() for successful

[jira] Issue Comment Edited: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834151#action_12834151 ] Olga Natkovich edited comment on PIG-1238 at 2/16/10 6:30 PM: --

[jira] Created: (PIG-1239) PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed

2010-02-16 Thread Pradeep Kamath (JIRA)
PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed Key: PIG-1239 URL:

[jira] Commented: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Hong Tang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834373#action_12834373 ] Hong Tang commented on PIG-1115: Why not requesting the patch to be back ported to Hadoop

[jira] Updated: (PIG-1229) allow pig to write output into a JDBC db

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich updated PIG-1229: Fix Version/s: (was: 0.6.0) 0.7.0 Updated version number since it is not a

[jira] Commented: (PIG-1233) NullPointerException in AVG

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834377#action_12834377 ] Olga Natkovich commented on PIG-1233: - Hi Ankur, We see this kind of issue with tests

[jira] Commented: (PIG-1234) Unable to create input slice for har:// files

2010-02-16 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834378#action_12834378 ] Tsz Wo (Nicholas), SZE commented on PIG-1234: - 1) '\n' is not a valid argument

[jira] Updated: (PIG-1233) NullPointerException in AVG

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olga Natkovich updated PIG-1233: Fix Version/s: (was: 0.6.0) 0.7.0 This does not seem like a candidate for

[jira] Commented: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Gaurav Jain (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834380#action_12834380 ] Gaurav Jain commented on PIG-1115: -- We discussed the backport with M/R team ( patch

[jira] Updated: (PIG-1213) Schema serialization is broken

2010-02-16 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-1213: Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available)

[jira] Created: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-02-16 Thread Gaurav Jain (JIRA)
[Zebra] suggestion to have zebra manifest file contain version and svn-revision etc. - Key: PIG-1240 URL: https://issues.apache.org/jira/browse/PIG-1240 Project:

Plan to merge load-store-redesign branch to trunk

2010-02-16 Thread Pradeep Kamath
Hi, We would like to merge the load-store-redesign branch to trunk tentatively on Thursday. To do this, I would like to request all committers to not commit anything to load-store-redesign branch or trunk during the period of the merge. I will send out a mail to indicate begin and end of this

[jira] Updated: (PIG-1239) PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed

2010-02-16 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-1239: Attachment: PIG-1239-load-store-redesign-branch.patch PIG-1239-branch-0.6.patch

[jira] Commented: (PIG-1239) PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834408#action_12834408 ] Olga Natkovich commented on PIG-1239: - +1 on both patches PigContext.connect() should

[jira] Commented: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834409#action_12834409 ] Yan Zhou commented on PIG-1115: --- patch reviewed +1 [zebra] temp files are not cleaned.

[jira] Commented: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-02-16 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834411#action_12834411 ] Pradeep Kamath commented on PIG-1216: - Review comments: * Is it ok to call outputSpecs

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834422#action_12834422 ] Daniel Dai commented on PIG-1238: - Hi, Ankur, I encounter syntax error in B = FOREACH A

[jira] Updated: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-02-16 Thread Gaurav Jain (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurav Jain updated PIG-1240: - Attachment: PIG-1240.patch Old looked like:

[jira] Commented: (PIG-1239) PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed

2010-02-16 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834443#action_12834443 ] Pradeep Kamath commented on PIG-1239: - * No unit tests are included in both patches since

[jira] Commented: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834445#action_12834445 ] Yan Zhou commented on PIG-1115: --- Hudson results on the load-store-redesign branch: +1 overall.

[jira] Resolved: (PIG-1239) PigContext.connect() should not create a jobClient and jobClient should be created on demand when needed

2010-02-16 Thread Pradeep Kamath (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath resolved PIG-1239. - Resolution: Fixed Hadoop Flags: [Reviewed] Patch committed to branch-0.6 and

[jira] Commented: (PIG-1215) Make Hadoop jobId more prominent in the client log

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834458#action_12834458 ] Olga Natkovich commented on PIG-1215: - The log part of the change is not quite right. I

[jira] Commented: (PIG-1234) Unable to create input slice for har:// files

2010-02-16 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834483#action_12834483 ] Tsz Wo (Nicholas), SZE commented on PIG-1234: - The patch worked fine: The

[jira] Commented: (PIG-1226) Need to be able to register jars on the command line

2010-02-16 Thread Olga Natkovich (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834488#action_12834488 ] Olga Natkovich commented on PIG-1226: - +1, changes look good. Thejas, can you make sure

[jira] Commented: (PIG-1234) Unable to create input slice for har:// files

2010-02-16 Thread Tsz Wo (Nicholas), SZE (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834491#action_12834491 ] Tsz Wo (Nicholas), SZE commented on PIG-1234: - Pig is amazing! The pig wordcount

[jira] Updated: (PIG-1169) Top-N queries produce incorrect results when a store statement is added between order by and limit statement

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1169: -- Attachment: PIG-1169.patch Top-N queries produce incorrect results when a store statement is added

[jira] Updated: (PIG-1169) Top-N queries produce incorrect results when a store statement is added between order by and limit statement

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1169: -- Status: Patch Available (was: Open) Top-N queries produce incorrect results when a store statement is

[jira] Created: (PIG-1241) Accumulator is turned on when a map is used with a non-accumulative UDF

2010-02-16 Thread Ying He (JIRA)
Accumulator is turned on when a map is used with a non-accumulative UDF --- Key: PIG-1241 URL: https://issues.apache.org/jira/browse/PIG-1241 Project: Pig Issue Type: Bug

[jira] Updated: (PIG-1241) Accumulator is turned on when a map is used with a non-accumulative UDF

2010-02-16 Thread Ying He (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ying He updated PIG-1241: - Attachment: accum.patch patch to check UDF when it's with map operation Accumulator is turned on when a map is

[jira] Updated: (PIG-1241) Accumulator is turned on when a map is used with a non-accumulative UDF

2010-02-16 Thread Ying He (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ying He updated PIG-1241: - Status: Patch Available (was: Open) Accumulator is turned on when a map is used with a non-accumulative UDF

[jira] Resolved: (PIG-1115) [zebra] temp files are not cleaned.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou resolved PIG-1115. --- Resolution: Fixed Patch committed to the load-store-redesign branch. [zebra] temp files are not cleaned.

[jira] Updated: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-02-16 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated PIG-1216: -- Attachment: pig-1216_1.patch bq. Is it ok to call outputSpecs multiple times [...] Talked with

[jira] Updated: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-02-16 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated PIG-1216: -- Status: Patch Available (was: Open) New load store design does not allow Pig to validate

[jira] Reopened: (PIG-965) PERFORMANCE: optimize common case in matches (PORegex)

2010-02-16 Thread Ankit Modi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankit Modi reopened PIG-965: Assignee: (was: Ankit Modi) I couldn't see the poregex2.patch patch applied in the code. automaton.jar

[jira] Commented: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834587#action_12834587 ] Yan Zhou commented on PIG-1240: --- +1 [Zebra] suggestion to have zebra manifest file contain

[jira] Updated: (PIG-1218) Use distributed cache to store samples

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1218: -- Attachment: PIG-1218_2.patch The second patch is for LSR branch and ready for review. Use distributed

[jira] Commented: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-02-16 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834608#action_12834608 ] Ashutosh Chauhan commented on PIG-1216: --- Thinking more about this. We don't do

[jira] Commented: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-02-16 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834637#action_12834637 ] Hadoop QA commented on PIG-1216: -1 overall. Here are the results of testing the latest

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834642#action_12834642 ] Ankur commented on PIG-1238: Daniel the correct syntax is - ['b'#['c'#'12']] as mapFields. Dump

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834643#action_12834643 ] Ankur commented on PIG-1238: Seems like inner [] are making parts of it appear underlined.

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834644#action_12834644 ] Ankur commented on PIG-1238: Sigh Enclose 'c'#'12' in a square bracket and then enclose 'b'#

[jira] Commented: (PIG-1233) NullPointerException in AVG

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834645#action_12834645 ] Ankur commented on PIG-1233: Olga, All queries that use AVG(), have null values for

[jira] Updated: (PIG-1233) NullPointerException in AVG

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated PIG-1233: --- Status: In Progress (was: Patch Available) NullPointerException in AVG

[jira] Updated: (PIG-1233) NullPointerException in AVG

2010-02-16 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated PIG-1233: --- Status: Patch Available (was: In Progress) Retrying as suggested by Olga NullPointerException in AVG

[jira] Commented: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12834650#action_12834650 ] Yan Zhou commented on PIG-1240: --- Results of Hudson run on the load-store-redesign branch: -1

[jira] Resolved: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-02-16 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou resolved PIG-1240. --- Resolution: Fixed Patch commited to the load-store-redesign branch. [Zebra] suggestion to have zebra