[jira] Assigned: (PIG-623) Fix spelling errors in output messages

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-623: -- Assignee: Tom White > Fix spelling errors in output messa

[jira] Assigned: (PIG-692) when running script file, automatically set up job name based on the file name

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-692: -- Assignee: Vadim Zaliva > when running script file, automatically set up job name based on the file n

[jira] Assigned: (PIG-703) Pig trunk/src/docs folders and files for forrest xml doc builds

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-703: -- Assignee: Corinne Chandel > Pig trunk/src/docs folders and files for forrest xml doc bui

[jira] Assigned: (PIG-704) Interactive mode doesn't list defined aliases

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-704: -- Assignee: Eric Gaudet > Interactive mode doesn't list defined

[jira] Assigned: (PIG-713) Autocompletion doesn't complete aliases

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-713: -- Assignee: Eric Gaudet > Autocompletion doesn't complete

[jira] Assigned: (PIG-712) Need utilities to create schemas for bags and tuples

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-712: -- Assignee: Jeff Zhang > Need utilities to create schemas for bags and tup

[jira] Assigned: (PIG-732) Utility UDFs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-732: -- Assignee: Ankur > Utility UDFs > - > > Key: PIG-732 >

[jira] Assigned: (PIG-715) Remove 2 doc files: hello.pdf and overview.html

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-715: -- Assignee: Corinne Chandel > Remove 2 doc files: hello.pdf and overview.h

[jira] Assigned: (PIG-833) Storage access layer

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-833: -- Assignee: Raghu Angadi > Storage access layer > > >

[jira] Assigned: (PIG-782) javadoc throws warnings - this would break hudson patch test process.

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-782: -- Assignee: Santhosh Srinivasan > javadoc throws warnings - this would break hudson patch test proc

[jira] Assigned: (PIG-781) Error reporting for failed MR jobs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-781: -- Assignee: Gunther Hagleitner > Error reporting for failed MR j

[jira] Assigned: (PIG-745) Please add DataTypes.toString() conversion function

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-745: -- Assignee: David Ciemiewicz > Please add DataTypes.toString() conversion funct

[jira] Assigned: (PIG-753) Provide support for UDFs without parameters

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-753: -- Assignee: Jeff Zhang > Provide support for UDFs without paramet

[jira] Assigned: (PIG-792) PERFORMANCE: Support skewed join in pig

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-792: -- Assignee: Sriranjan Manjunath > PERFORMANCE: Support skewed join in

[jira] Assigned: (PIG-795) Command that selects a random sample of the rows, similar to LIMIT

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-795: -- Assignee: Eric Gaudet > Command that selects a random sample of the rows, similar to LI

[jira] Assigned: (PIG-796) support conversion from numeric types to chararray

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-796: -- Assignee: Ashutosh Chauhan > support conversion from numeric types to charar

[jira] Assigned: (PIG-817) Pig Docs for 0.3.0 Release

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-817: -- Assignee: Corinne Chandel > Pig Docs for 0.3.0 Rele

[jira] Assigned: (PIG-825) PIG_HADOOP_VERSION should be 18

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-825: -- Assignee: Dmitriy V. Ryaboy > PIG_HADOOP_VERSION should be

[jira] Assigned: (PIG-802) PERFORMANCE: not creating bags for ORDER BY

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-802: -- Assignee: Rakesh Setty > PERFORMANCE: not creating bags for ORDER

[jira] Assigned: (PIG-830) Port Apache Log parsing piggybank contrib to Pig 0.2

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-830: -- Assignee: Dmitriy V. Ryaboy > Port Apache Log parsing piggybank contrib to Pig

[jira] Assigned: (PIG-837) docs ant target is broken

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-837: -- Assignee: Olga Natkovich > docs ant target is bro

[jira] Assigned: (PIG-868) indexof / lastindexof / lower / replace / substring udf's

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-868: -- Assignee: Bennie Schut > indexof / lastindexof / lower / replace / substring ud

[jira] Assigned: (PIG-849) Local engine loses records in splits

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-849: -- Assignee: Gunther Hagleitner > Local engine loses records in spl

[jira] Assigned: (PIG-862) Pig Site - 0.3.0 updates

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-862: -- Assignee: Corinne Chandel > Pig Site - 0.3.0 updates > > >

[jira] Assigned: (PIG-890) Create a sampler interface and improve the skewed join sampler

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-890: -- Assignee: Sriranjan Manjunath > Create a sampler interface and improve the skewed join samp

[jira] Assigned: (PIG-895) Default parallel for Pig

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-895: -- Assignee: Daniel Dai > Default parallel for Pig > > >

[jira] Assigned: (PIG-907) Provide multiple version of HashFNV (Piggybank)

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-907: -- Assignee: Daniel Dai > Provide multiple version of HashFNV (Piggyb

[jira] Assigned: (PIG-905) TOKENIZE throws exception on null data

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-905: -- Assignee: Daniel Dai > TOKENIZE throws exception on null d

[jira] Assigned: (PIG-913) Error in Pig script when grouping on chararray column

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-913: -- Assignee: Daniel Dai > Error in Pig script when grouping on chararray col

[jira] Assigned: (PIG-911) [Piggybank] SequenceFileLoader

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-911: -- Assignee: Dmitriy V. Ryaboy > [Piggybank] SequenceFileLoa

[jira] Assigned: (PIG-919) Type mismatch in key from map: expected org.apache.pig.impl.io.NullableBytesWritable, recieved org.apache.pig.impl.io.NullableText when doing simple group

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-919: -- Assignee: Viraj Bhat > Type mismatch in key from map: expec

[jira] Assigned: (PIG-929) Default value of memusage for skewed join is not correct

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-929: -- Assignee: Ying He > Default value of memusage for skewed join is not corr

[jira] Assigned: (PIG-924) Make Pig work with multiple versions of Hadoop

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-924: -- Assignee: Dmitriy V. Ryaboy > Make Pig work with multiple versions of Had

[jira] Assigned: (PIG-923) Allow setting logfile location in pig.properties

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-923: -- Assignee: Dmitriy V. Ryaboy > Allow setting logfile location in pig.propert

[jira] Assigned: (PIG-935) Skewed join throws an exception when used with map keys

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-935: -- Assignee: Sriranjan Manjunath > Skewed join throws an exception when used with map k

[jira] Assigned: (PIG-958) Splitting output data on key field

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-958: -- Assignee: Ankur > Splitting output data on key fi

[jira] Assigned: (PIG-960) Using Hadoop's optimized LineRecordReader for reading Tuples in PigStorage

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-960: -- Assignee: Ankit Modi > Using Hadoop's optimized LineRecordReader for reading Tuples in Pi

[jira] Assigned: (PIG-938) Pig Docs for 0.4.0

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-938: -- Assignee: Corinne Chandel > Pig Docs for 0.4.0 > -- > >

[jira] Assigned: (PIG-968) findContainingJar fails when there's a + in the path

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-968: -- Assignee: Todd Lipcon > findContainingJar fails when there's a + in

[jira] Assigned: (PIG-989) Allow type merge between numerical type and non-numerical type

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-989: -- Assignee: Daniel Dai > Allow type merge between numerical type and non-numerical t

[jira] Assigned: (PIG-1008) FINDBUGS: NP_TOSTRING_COULD_RETURN_NULL

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1008: --- Assignee: Olga Natkovich > FINDBUGS: NP_TOSTRING_COULD_RETURN_N

[jira] Assigned: (PIG-1006) FINDBUGS: EQ_COMPARETO_USE_OBJECT_EQUALS in bags and tuples

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1006: --- Assignee: Olga Natkovich > FINDBUGS: EQ_COMPARETO_USE_OBJECT_EQUALS in bags and tup

[jira] Assigned: (PIG-1007) FINDBUGS: HE_EQUALS_USE_HASHCODE

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1007: --- Assignee: Olga Natkovich > FINDBUGS: HE_EQUALS_USE_HASHC

[jira] Assigned: (PIG-1009) FINDBUGS: OS_OPEN_STREAM: Method may fail to close stream

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1009: --- Assignee: Olga Natkovich > FINDBUGS: OS_OPEN_STREAM: Method may fail to close str

[jira] Assigned: (PIG-1010) FINDBUGS: RV_RETURN_VALUE_IGNORED_BAD_PRACTICE

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1010: --- Assignee: Olga Natkovich > FINDBUGS: RV_RETURN_VALUE_IGNORED_BAD_PRACT

[jira] Assigned: (PIG-1012) FINDBUGS: SE_BAD_FIELD: Non-transient non-serializable instance field in serializable class

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1012: --- Assignee: Olga Natkovich > FINDBUGS: SE_BAD_FIELD: Non-transient non-serializable instance field

[jira] Assigned: (PIG-1015) [piggybank] DateExtractor should take into account timezones

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1015: --- Assignee: Dmitriy V. Ryaboy > [piggybank] DateExtractor should take into account timezo

[jira] Assigned: (PIG-1013) FINDBUGS: DMI_INVOKING_TOSTRING_ON_ARRAY: Invocation of toString on an array

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1013: --- Assignee: Olga Natkovich > FINDBUGS: DMI_INVOKING_TOSTRING_ON_ARRAY: Invocation of toString on

[jira] Assigned: (PIG-1011) FINDBUGS: SE_NO_SERIALVERSIONID: Class is Serializable, but doesn't define serialVersionUID

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1011: --- Assignee: Olga Natkovich > FINDBUGS: SE_NO_SERIALVERSIONID: Class is Serializable, but doesn'

[jira] Assigned: (PIG-1018) FINDBUGS: NM_FIELD_NAMING_CONVENTION: Field names should start with a lower case letter

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1018: --- Assignee: Olga Natkovich > FINDBUGS: NM_FIELD_NAMING_CONVENTION: Field names should start with a lo

[jira] Assigned: (PIG-1032) FINDBUGS: DM_STRING_CTOR: Method invokes inefficient new String(String) constructor

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1032: --- Assignee: Olga Natkovich > FINDBUGS: DM_STRING_CTOR: Method invokes inefficient new String(Str

[jira] Updated: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1085: Attachment: udfconf-2.patch > Pass JobConf and UDF specific configuration information to U

[jira] Updated: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1085: Status: Patch Available (was: Open) Uploading new patch that addresses javac warnings and release audit

[jira] Updated: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1085: Status: Open (was: Patch Available) > Pass JobConf and UDF specific configuration information to U

[jira] Assigned: (PIG-1033) javac warnings: deprecated hadoop APIs

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1033: --- Assignee: Daniel Dai > javac warnings: deprecated hadoop A

[jira] Assigned: (PIG-1039) Pig 0.5 Doc Updates

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1039: --- Assignee: Corinne Chandel > Pig 0.5 Doc Updates > --- > >

[jira] Commented: (PIG-1064) Behvaiour of COGROUP with and without schema when using "*" operator

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12776520#action_12776520 ] Alan Gates commented on PIG-1064: - Why is cogrouping on * without a schema causing tro

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-11-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12776509#action_12776509 ] Alan Gates commented on PIG-966: Size on disk. It's not quite useless, as it can b

Re: package org.apache.hadoop.zebra.parse missing

2009-11-11 Thread Alan Gates
The parser package is generated as part of the build. Doing invoking ant in the contrib/zebra directory should result in the parser package being created at ./src-gen/org/apache/hadoop/zebra/parser Alan. On Nov 11, 2009, at 12:54 AM, Min Zhou wrote: Hi guys, I checked out pig from trunk,

[jira] Updated: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1085: Status: Patch Available (was: Open) > Pass JobConf and UDF specific configuration information to U

[jira] Updated: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1085: Attachment: udfconf.patch The attached patch creates a new singleton class UDFContext. This class contains

[jira] Created: (PIG-1085) Pass JobConf and UDF specific configuration information to UDFs

2009-11-10 Thread Alan Gates (JIRA)
Components: impl Reporter: Alan Gates Assignee: Alan Gates Users have long asked for a way to get the JobConf structure in their UDFs. It would also be nice to have a way to pass properties between the front end and back end so that UDFs can store state during parse

[jira] Updated: (PIG-1080) PigStorage may miss records when loading a file

2009-11-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1080: Affects Version/s: 0.6.0 To be clear, this bug affects only trunk code, not any released version of Pig

[jira] Commented: (PIG-760) Serialize schemas for PigStorage() and other storage types.

2009-11-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12776086#action_12776086 ] Alan Gates commented on PIG-760: The issue is we want to break interfaces once, so we d

[jira] Commented: (PIG-1065) In-determinate behaviour of Union when there are 2 non-matching schema's

2009-11-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12775961#action_12775961 ] Alan Gates commented on PIG-1065: - As originally defined UNION does allow two inputs t

[jira] Updated: (PIG-1069) [zebra] Order Preserving Sorted Table Union

2009-11-09 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1069: Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Patch

[jira] Commented: (PIG-979) Acummulator Interface for UDFs

2009-11-09 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12775158#action_12775158 ] Alan Gates commented on PIG-979: A test should be added that checks that when accumul

Re: [VOTE] Branch for Pig 0.6.0 release

2009-11-09 Thread Alan Gates
+1. In addition to the new features we've added, our change to use Hadoop's LineRecordReader brought Pig to parity with Hadoop in the PigMix tests, about a 30% average performance improvement. This should be huge for our users. Alan. On Nov 9, 2009, at 12:26 PM, Olga Natkovich wrote: H

[jira] Updated: (PIG-997) [zebra] Sorted Table Support by Zebra

2009-11-05 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-997: --- Resolution: Fixed Status: Resolved (was: Patch Available) All the nightly tests now pass. Patch

Re: [jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-11-05 Thread Alan Gates
Switching to pig-dev since the JIRA need not record discussions on release planning. I don't know if there will be a 0.5.1 or not. We don't currently have a proposed release date for 0.6.0. PIG-1048 is a fairly serious bug in the skew join stuff. We may want to consider a 0.5.1 release

[jira] Commented: (PIG-997) [zebra] Sorted Table Support by Zebra

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12773192#action_12773192 ] Alan Gates commented on PIG-997: After applying this patch TestColumnSecurity fails.

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12773314#action_12773314 ] Alan Gates commented on PIG-970: Yes, it's there. > Support of

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12773348#action_12773348 ] Alan Gates commented on PIG-970: afterside:~/src/pig/PIG-970-3/trunk> jar

Re: LoadFunc.skipNext() function for faster sampling ?

2009-11-03 Thread Alan Gates
We definitely want to avoid parsing every tuple when sampling. But do we need to implement a special function for it? Pig will have access to the InputFormat instance, correct? Can it not call InputFormat.getNext the desired number of times (which will not parse the tuple) and then call

[jira] Updated: (PIG-970) Support of HBase 0.20.0

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-970: --- Attachment: test-output.tgz TEST-org.apache.pig.test.TestHBaseStorage.txt Test run results plus

[jira] Commented: (PIG-1048) inner join using 'skewed' produces multiple rows for keys with single row in both input relations

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1277#action_1277 ] Alan Gates commented on PIG-1048: - When attempting to apply this patch to the 0.5 branc

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-11-03 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12773103#action_12773103 ] Alan Gates commented on PIG-970: When I run TestHBaseStorage now I get: Test

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-11-02 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772781#action_12772781 ] Alan Gates commented on PIG-970: Patch doesn't include binary files. I'll pul

[jira] Commented: (PIG-1038) Optimize nested distinct/sort to use secondary key

2009-11-02 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772774#action_12772774 ] Alan Gates commented on PIG-1038: - I agree that we need a framework for optimizations in

[jira] Commented: (PIG-1037) better memory layout and spill for sorted and distinct bags

2009-11-02 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772772#action_12772772 ] Alan Gates commented on PIG-1037: - The difference is much more than switching from dum

[jira] Resolved: (PIG-477) passing properties from command line to the backend

2009-10-30 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates resolved PIG-477. Resolution: Duplicate Marking as duplicate of PIG-602 > passing properties from command line to the back

[jira] Updated: (PIG-1048) inner join using 'skewed' produces multiple rows for keys with single row in both input relations

2009-10-30 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1048: Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Patch

[jira] Commented: (PIG-1053) Consider moving to Hadoop for local mode

2009-10-30 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772144#action_12772144 ] Alan Gates commented on PIG-1053: - For testing purposes we could simply change Main to

[jira] Assigned: (PIG-1053) Consider moving to Hadoop for local mode

2009-10-30 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1053: --- Assignee: Ankit Modi > Consider moving to Hadoop for local m

[jira] Updated: (PIG-1057) [Zebra] Zebra does not support concurrent deletions of column groups now.

2009-10-30 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1057: Resolution: Fixed Status: Resolved (was: Patch Available) Patch checked in. > [Zebra] Zebra d

[jira] Commented: (PIG-1048) inner join using 'skewed' produces multiple rows for keys with single row in both input relations

2009-10-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771203#action_12771203 ] Alan Gates commented on PIG-1048: - Could you describe briefly the cause of the problem

[jira] Commented: (PIG-1016) Reading in map data seems broken

2009-10-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771200#action_12771200 ] Alan Gates commented on PIG-1016: - I am keeping an eye on this ticket. But at this p

[jira] Commented: (PIG-1001) Generate more meaningful error message when one input file does not exist

2009-10-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771072#action_12771072 ] Alan Gates commented on PIG-1001: - I have a question on this cod

[jira] Updated: (PIG-1037) better memory layout and spill for sorted and distinct bags

2009-10-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1037: Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Patch

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-10-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770974#action_12770974 ] Alan Gates commented on PIG-970: I haven't been able to get the unit test to p

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-10-27 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770580#action_12770580 ] Alan Gates commented on PIG-966: A new branch in svn, load-store-redesign, has been cre

[jira] Commented: (PIG-760) Serialize schemas for PigStorage() and other storage types.

2009-10-27 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770573#action_12770573 ] Alan Gates commented on PIG-760: I know I'm wandering dangerously close to being

Re: [VOTE] Release Pig 0.5.0 (candidate 0)

2009-10-26 Thread Alan Gates
+1 On my laptop (mac) ran tutorial in both local and hadoop modes, ran a join/group/sort/limit script in both local and hadoop modes, did build of pig and contrib. On linux box did build of both pig and contrib, ran a join/group/sort/ limit script in both local and hadoop modes. Alan. On

[jira] Commented: (PIG-1053) Consider moving to Hadoop for local mode

2009-10-26 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770237#action_12770237 ] Alan Gates commented on PIG-1053: - Currently Pig has its own backend implementa

[jira] Created: (PIG-1053) Consider moving to Hadoop for local mode

2009-10-26 Thread Alan Gates (JIRA)
Consider moving to Hadoop for local mode Key: PIG-1053 URL: https://issues.apache.org/jira/browse/PIG-1053 Project: Pig Issue Type: Improvement Reporter: Alan Gates We need to consider

[jira] Commented: (PIG-1037) better memory layout and spill for sorted and distinct bags

2009-10-26 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770206#action_12770206 ] Alan Gates commented on PIG-1037: - Comments: In InternalSortedBag.add, you are calcula

[jira] Updated: (PIG-996) [zebra] Zebra build script does not have findbugs and clover targets.

2009-10-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-996: --- Resolution: Fixed Status: Resolved (was: Patch Available) Checked in the patch. > [zebra] Zebra bu

[jira] Updated: (PIG-1027) Number of bytes written are always zero in local mode

2009-10-23 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1027: Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Fix

[jira] Updated: (PIG-984) PERFORMANCE: Implement a map-side group operator to speed up processing of ordered data

2009-10-22 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-984: --- Resolution: Fixed Status: Resolved (was: Patch Available) Patch committed. Thanks Richard

[jira] Commented: (PIG-927) null should be handled consistently in Join

2009-10-22 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768825#action_12768825 ] Alan Gates commented on PIG-927: Sorry, I missed the \t at the end of the line. Test l

<    3   4   5   6   7   8   9   10   11   12   >