[jira] Commented: (PIG-1234) Unable to create input slice for har:// files

2010-02-12 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833233#action_12833233 ] Richard Ding commented on PIG-1234: --- +1 for commit. > Unable to create input slice for har

[jira] Commented: (PIG-1131) Pig simple join does not work when it contains empty lines

2010-02-12 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833250#action_12833250 ] Richard Ding commented on PIG-1131: --- +1 for commit. > Pig simple join does not work when i

[jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema

2010-02-12 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833274#action_12833274 ] Richard Ding commented on PIG-1188: --- I suggest we don't change the current behavior of Pig

[jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834362#action_12834362 ] Richard Ding commented on PIG-1188: --- Actually, Pig is already padding nulls to the input tu

[jira] Updated: (PIG-1169) Top-N queries produce incorrect results when a store statement is added between order by and limit statement

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1169: -- Attachment: PIG-1169.patch > Top-N queries produce incorrect results when a store statement is added > b

[jira] Updated: (PIG-1169) Top-N queries produce incorrect results when a store statement is added between order by and limit statement

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1169: -- Status: Patch Available (was: Open) > Top-N queries produce incorrect results when a store statement is

[jira] Updated: (PIG-1218) Use distributed cache to store samples

2010-02-16 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1218: -- Attachment: PIG-1218_2.patch The second patch is for LSR branch and ready for review. > Use distributed

[jira] Updated: (PIG-1194) ERROR 2055: Received Error while processing the map plan

2010-02-17 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1194: -- Resolution: Fixed Status: Resolved (was: Patch Available) The second patch is committed. > ERRO

[jira] Updated: (PIG-1169) Top-N queries produce incorrect results when a store statement is added between order by and limit statement

2010-02-17 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1169: -- Resolution: Fixed Status: Resolved (was: Patch Available) patch committed. > Top-N queries prod

[jira] Updated: (PIG-1218) Use distributed cache to store samples

2010-02-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1218: -- Attachment: PIG-1218_2.patch Updated the patch to address the comments of Pradeep and Ashutosh. > Use di

[jira] Updated: (PIG-1218) Use distributed cache to store samples

2010-02-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1218: -- Attachment: (was: PIG-1218_2.patch) > Use distributed cache to store samples > --

[jira] Updated: (PIG-1218) Use distributed cache to store samples

2010-02-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1218: -- Attachment: PIG-1218_3.patch The patch 3 includes all of patch 2 plus distributed cache for merge join's

[jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema

2010-02-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12835944#action_12835944 ] Richard Ding commented on PIG-1188: --- To summarize where we are: Right now Pig project oper

[jira] Commented: (PIG-1250) Make StoreFunc an abstract class and create a mirror interface called StoreFuncInterface

2010-02-22 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836968#action_12836968 ] Richard Ding commented on PIG-1250: --- +1 > Make StoreFunc an abstract class and create a mi

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-22 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Attachment: PIG-1079.patch Attached patch uses Hadoop DistributedCache for distribution of merge join in

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-22 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Status: Patch Available (was: Open) > Modify merge join to use distributed cache to maintain the index >

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Status: Open (was: Patch Available) > Modify merge join to use distributed cache to maintain the index >

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Attachment: PIG-1079.patch New patch to remove the javadoc warning. > Modify merge join to use distribut

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Status: Patch Available (was: Open) > Modify merge join to use distributed cache to maintain the index >

[jira] Commented: (PIG-613) Casting elements inside a tuple does not take effect

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837398#action_12837398 ] Richard Ding commented on PIG-613: -- Initial comment: the patch also needs to remove reference

[jira] Commented: (PIG-613) Casting elements inside a tuple does not take effect

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837479#action_12837479 ] Richard Ding commented on PIG-613: -- The patch looks good. A few comments: * File TestText

[jira] Assigned: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-02-23 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1252: - Assignee: Richard Ding > Diamond splitter does not generate correct results when using Multi-query

[jira] Updated: (PIG-1079) Modify merge join to use distributed cache to maintain the index

2010-02-24 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1079: -- Resolution: Fixed Status: Resolved (was: Patch Available) Patch committed (No new unit test is a

[jira] Resolved: (PIG-1243) Passing Complex map types to and from streaming causes a problem

2010-02-24 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding resolved PIG-1243. --- Resolution: Fixed This is fixed by way of PIG-613. > Passing Complex map types to and from streaming c

[jira] Commented: (PIG-1265) Change LoadMetadata and StoreMetadata to use Job instead of Configuraiton and add a cleanupOnFailure method to StoreFuncInterface

2010-02-26 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12839104#action_12839104 ] Richard Ding commented on PIG-1265: --- +1 > Change LoadMetadata and StoreMetadata to use Job

[jira] Created: (PIG-1267) Problems with partition filter optimizer

2010-02-26 Thread Richard Ding (JIRA)
Problems with partition filter optimizer Key: PIG-1267 URL: https://issues.apache.org/jira/browse/PIG-1267 Project: Pig Issue Type: Bug Affects Versions: 0.7.0 Reporter: Richard Ding

[jira] Updated: (PIG-1267) Problems with partition filter optimizer

2010-03-01 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1267: -- Attachment: PIG-1267.patch > Problems with partition filter optimizer > -

[jira] Updated: (PIG-1267) Problems with partition filter optimizer

2010-03-01 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1267: -- Status: Patch Available (was: Open) > Problems with partition filter optimizer > ---

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-03 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Status: Patch Available (was: Open) > Diamond splitter does not generate correct results when using Mult

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-03 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Attachment: PIG-1252.patch This is the result of diamond query optimizer merging a job that has secondary

[jira] Commented: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-03 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840889#action_12840889 ] Richard Ding commented on PIG-1252: --- The secondary key optimization is documented in PIG-10

[jira] Updated: (PIG-1273) Skewed join throws error

2010-03-03 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1273: -- Attachment: PIG-1273.patch Modify the code to handle the empty input file being sampled for skewed join.

[jira] Updated: (PIG-1273) Skewed join throws error

2010-03-03 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1273: -- Status: Patch Available (was: Open) > Skewed join throws error > - > >

[jira] Updated: (PIG-1267) Problems with partition filter optimizer

2010-03-04 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1267: -- Resolution: Fixed Status: Resolved (was: Patch Available) > Problems with partition filter optim

[jira] Updated: (PIG-1273) Skewed join throws error

2010-03-04 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1273: -- Resolution: Fixed Status: Resolved (was: Patch Available) > Skewed join throws error >

[jira] Updated: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1260: -- Attachment: PIG-1260.patch The cause is the EOL is missing at the end of script. The problem is that thi

[jira] Updated: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1260: -- Status: Patch Available (was: Open) > Param Subsitution results in parser error if there is no EOL after

[jira] Updated: (PIG-1264) Skewed join sampler misses out the key with the highest frequency

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1264: -- Affects Version/s: 0.7.0 > Skewed join sampler misses out the key with the highest frequency > --

[jira] Commented: (PIG-1264) Skewed join sampler misses out the key with the highest frequency

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12841984#action_12841984 ] Richard Ding commented on PIG-1264: --- Somehow during the many merges between the trunk and L

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Status: Open (was: Patch Available) > Diamond splitter does not generate correct results when using Mult

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Status: Patch Available (was: Open) > Diamond splitter does not generate correct results when using Mult

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Attachment: PIG-1252.patch > Diamond splitter does not generate correct results when using Multi-query >

[jira] Resolved: (PIG-1264) Skewed join sampler misses out the key with the highest frequency

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding resolved PIG-1264. --- Resolution: Fixed > Skewed join sampler misses out the key with the highest frequency > ---

[jira] Created: (PIG-1279) Make sample loaders interchangeable

2010-03-05 Thread Richard Ding (JIRA)
Make sample loaders interchangeable Key: PIG-1279 URL: https://issues.apache.org/jira/browse/PIG-1279 Project: Pig Issue Type: Bug Reporter: Richard Ding In Pig 0.6 one can use random sampl

[jira] Updated: (PIG-1279) Make sample loaders interchangeable

2010-03-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1279: -- Affects Version/s: 0.7.0 > Make sample loaders interchangeable > >

[jira] Assigned: (PIG-1238) Dump does not respect the schema

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1238: - Assignee: Richard Ding (was: Daniel Dai) > Dump does not respect the schema >

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Attachment: (was: PIG-1252.patch) > Diamond splitter does not generate correct results when using Mul

[jira] Updated: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1260: -- Status: Patch Available (was: Open) > Param Subsitution results in parser error if there is no EOL after

[jira] Updated: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1260: -- Status: Open (was: Patch Available) > Param Subsitution results in parser error if there is no EOL after

[jira] Resolved: (PIG-1278) Type mismatch in key from map: expected org.apache.pig.impl.io.NullableFloatWritable, recieved org.apache.pig.impl.io.NullableText

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding resolved PIG-1278. --- Resolution: Duplicate Release Note: PIG-1238 > Type mismatch in key from map: expected > org.apa

[jira] Updated: (PIG-1238) Dump does not respect the schema

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1238: -- Attachment: PIG-1238.patch Pig inserts a new Limit (or Top-K) job with one reducer after a Limit (or Top

[jira] Updated: (PIG-1238) Dump does not respect the schema

2010-03-08 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1238: -- Status: Patch Available (was: Open) > Dump does not respect the schema > ---

[jira] Updated: (PIG-1238) Dump does not respect the schema

2010-03-10 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1238: -- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) > Dum

[jira] Updated: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-03-10 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1260: -- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) > Par

[jira] Commented: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-10 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12843764#action_12843764 ] Richard Ding commented on PIG-1252: --- +1 for Daniel's patch > Diamond splitter does not gen

[jira] Updated: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-03-10 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1252: -- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) > Dia

[jira] Commented: (PIG-1275) empty bag in PigStorage read as null

2010-03-10 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12843866#action_12843866 ] Richard Ding commented on PIG-1275: --- +1 > empty bag in PigStorage read as null > -

[jira] Updated: (PIG-1284) pig UDF is lacking XMLLoader. Plan to add the XMLLoader

2010-03-11 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1284: -- Status: Open (was: Patch Available) > pig UDF is lacking XMLLoader. Plan to add the XMLLoader >

[jira] Updated: (PIG-1266) Show spill count on the pig console at the end of the job

2010-03-12 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1266: -- Fix Version/s: 0.7.0 Now the spill count is disabled due to Pig's changing to new Hadoop API in 0.7. We

[jira] Created: (PIG-1298) Restore file traveral behavior to Pig loaders

2010-03-15 Thread Richard Ding (JIRA)
Restore file traveral behavior to Pig loaders - Key: PIG-1298 URL: https://issues.apache.org/jira/browse/PIG-1298 Project: Pig Issue Type: Bug Affects Versions: 0.7.0 Reporter: Rich

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-15 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Summary: Restore file traversal behavior to Pig loaders (was: Restore file traveral behavior to Pig load

[jira] Created: (PIG-1299) Implement Pig counter to track number of output rows for each output files

2010-03-15 Thread Richard Ding (JIRA)
Implement Pig counter to track number of output rows for each output files Key: PIG-1299 URL: https://issues.apache.org/jira/browse/PIG-1299 Project: Pig Issue Ty

[jira] Updated: (PIG-1266) Show spill count on the pig console at the end of the job

2010-03-15 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1266: -- Assignee: Richard Ding (was: Sriranjan Manjunath) Assign to me to add code enabling the Pig counters. >

[jira] Assigned: (PIG-1279) Make sample loaders interchangeable

2010-03-15 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1279: - Assignee: Richard Ding > Make sample loaders interchangeable > ---

[jira] Commented: (PIG-1287) Use hadoop-0.20.2 with pig 0.7.0 release

2010-03-17 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846649#action_12846649 ] Richard Ding commented on PIG-1287: --- +1 > Use hadoop-0.20.2 with pig 0.7.0 release > -

[jira] Updated: (PIG-1299) Implement Pig counter to track number of output rows for each output files

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1299: -- Fix Version/s: (was: 0.7.0) 0.8.0 > Implement Pig counter to track number of outp

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Status: Patch Available (was: Open) > Restore file traversal behavior to Pig loaders > -

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Attachment: PIG-1298.patch This patch added PigFileInputFormat, PigTextInputFormat and PigSequenceFileIn

[jira] Commented: (PIG-1266) Show spill count on the pig console at the end of the job

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847144#action_12847144 ] Richard Ding commented on PIG-1266: --- +1 > Show spill count on the pig console at the end o

[jira] Assigned: (PIG-1266) Show spill count on the pig console at the end of the job

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1266: - Assignee: Sriranjan Manjunath (was: Richard Ding) Pig counter is enabled by PIG-1287. > Show spi

[jira] Updated: (PIG-1266) Show spill count on the pig console at the end of the job

2010-03-18 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1266: -- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) > Sho

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Status: Patch Available (was: Open) > Restore file traversal behavior to Pig loaders > -

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Status: Open (was: Patch Available) > Restore file traversal behavior to Pig loaders > -

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Attachment: PIG-1298_1.patch Fix release audit issue. > Restore file traversal behavior to Pig loaders >

[jira] Commented: (PIG-1238) Dump does not respect the schema

2010-03-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847474#action_12847474 ] Richard Ding commented on PIG-1238: --- Hi Ankur, I run following script {code} A = LOAD '1.

[jira] Updated: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-03-19 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1298: -- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) > Res

[jira] Commented: (PIG-1325) Provide a way to exclude a testcase when running "ant test"

2010-03-22 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12848449#action_12848449 ] Richard Ding commented on PIG-1325: --- +1 > Provide a way to exclude a testcase when running

[jira] Updated: (PIG-1299) Implement Pig counter to track number of output rows for each output files

2010-03-25 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1299: -- Attachment: PIG-1299.patch This patch adds a Hadoop counter group: MultiStoreCounters. In the case of a

[jira] Updated: (PIG-1299) Implement Pig counter to track number of output rows for each output files

2010-03-25 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1299: -- Status: Patch Available (was: Open) > Implement Pig counter to track number of output rows for each out

[jira] Commented: (PIG-1316) TextLoader should use Bzip2TextInputFormat for bzip files so that bzip files can be efficiently processed by splitting the files

2010-03-26 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12850246#action_12850246 ] Richard Ding commented on PIG-1316: --- +1 > TextLoader should use Bzip2TextInputFormat for b

[jira] Commented: (PIG-1336) Optimize POStore serialized into JobConf

2010-03-30 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851518#action_12851518 ] Richard Ding commented on PIG-1336: --- In the multi-store case, the parent plan can be set in

[jira] Commented: (PIG-1335) UDFFinder should find LoadFunc used by POCast

2010-03-30 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851519#action_12851519 ] Richard Ding commented on PIG-1335: --- +1 > UDFFinder should find LoadFunc used by POCast >

[jira] Commented: (PIG-1336) Optimize POStore serialized into JobConf

2010-03-31 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851970#action_12851970 ] Richard Ding commented on PIG-1336: --- +1 > Optimize POStore serialized into JobConf > -

[jira] Updated: (PIG-1341) BinStorage cannot convert DataByteArray to Chararray and results in FIELD_DISCARDED_TYPE_CONVERSION_FAILED

2010-03-31 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1341: -- Summary: BinStorage cannot convert DataByteArray to Chararray and results in FIELD_DISCARDED_TYPE_CONVERS

[jira] Updated: (PIG-1341) BinStorage cannot convert DataByteArray to Chararray and results in FIELD_DISCARDED_TYPE_CONVERSION_FAILED

2010-03-31 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1341: -- Attachment: PIG-1341.patch > BinStorage cannot convert DataByteArray to Chararray and results in > FIELD

[jira] Updated: (PIG-1341) BinStorage cannot convert DataByteArray to Chararray and results in FIELD_DISCARDED_TYPE_CONVERSION_FAILED

2010-03-31 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1341: -- Status: Patch Available (was: Open) > BinStorage cannot convert DataByteArray to Chararray and results i

[jira] Assigned: (PIG-1333) API interface to Pig

2010-04-01 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1333: - Assignee: Richard Ding > API interface to Pig > > > Key: PIG-1

[jira] Assigned: (PIG-864) Record graph of execution of Map-Reduce jobs executed by a Pig script

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-864: Assignee: Richard Ding > Record graph of execution of Map-Reduce jobs executed by a Pig script > -

[jira] Assigned: (PIG-1280) Add a pig-script-id to the JobConf of all jobs run in a pig-script

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-1280: - Assignee: Richard Ding > Add a pig-script-id to the JobConf of all jobs run in a pig-script > -

[jira] Assigned: (PIG-809) number of input lines it processed, number of output lines it produced for PIG job

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-809: Assignee: Richard Ding > number of input lines it processed, number of output lines it produced for >

[jira] Assigned: (PIG-857) Pig should implement Tool interface from Hadoop

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-857: Assignee: Richard Ding > Pig should implement Tool interface from Hadoop > ---

[jira] Assigned: (PIG-62) Need to add pig script and input dirs (in clear text format) to jobconf

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-62?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-62: --- Assignee: Richard Ding > Need to add pig script and input dirs (in clear text format) to jobconf > --

[jira] Commented: (PIG-1348) InternalCachedBag running out of memory

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853016#action_12853016 ] Richard Ding commented on PIG-1348: --- The problem seems not with the InternalCachedBag. The

[jira] Commented: (PIG-864) Record graph of execution of Map-Reduce jobs executed by a Pig script

2010-04-02 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853031#action_12853031 ] Richard Ding commented on PIG-864: -- I'm thinking about logging this information as part of im

[jira] Updated: (PIG-1348) PigStorage making unnecessary byte array copy when storing data

2010-04-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1348: -- Summary: PigStorage making unnecessary byte array copy when storing data (was: InternalCachedBag running

[jira] Assigned: (PIG-908) Need a way to correlate MR jobs with Pig statements

2010-04-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding reassigned PIG-908: Assignee: Richard Ding > Need a way to correlate MR jobs with Pig statements > ---

[jira] Commented: (PIG-864) Record graph of execution of Map-Reduce jobs executed by a Pig script

2010-04-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853546#action_12853546 ] Richard Ding commented on PIG-864: -- OK, I'll take PIG-908 and see if it can be knocked out :-

[jira] Updated: (PIG-1348) PigStorage making unnecessary byte array copy when storing data

2010-04-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1348: -- Attachment: PIG-1348.patch This patch removes the extra copying of byte arrays in PigStorage. > PigStor

[jira] Updated: (PIG-1348) PigStorage making unnecessary byte array copy when storing data

2010-04-05 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1348: -- Status: Patch Available (was: Open) > PigStorage making unnecessary byte array copy when storing data >

[jira] Updated: (PIG-1348) PigStorage making unnecessary byte array copy when storing data

2010-04-06 Thread Richard Ding (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1348: -- Status: Open (was: Patch Available) > PigStorage making unnecessary byte array copy when storing data >

<    1   2   3   4   5   6   7   >