[jira] Updated: (PIG-1303) unable to set outgoing format for org.apache.pig.piggybank.evaluation.util.apachelogparser.DateExtractor

2010-04-15 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PIG-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johannes Rußek updated PIG-1303:


Attachment: TypeCheckingVisitor.java.diff

Hello everybody,
this fixes it for us.
Apparently due to the Typemagic the FuncSpec that is returned by our 
EvalFunc.getArgToFuncMapping() which doesn't contain the constructor arguments 
is used instead of the perfectly fine one in func, which contains the 
constructor arguments.
We've just copied the arguments from func to matchingSpec to make it work.
Johannes

 unable to set outgoing format for 
 org.apache.pig.piggybank.evaluation.util.apachelogparser.DateExtractor
 

 Key: PIG-1303
 URL: https://issues.apache.org/jira/browse/PIG-1303
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.6.0
 Environment: pig 0.6.0 on a fedora linux machine, jdk 1.6 u11
Reporter: Johannes Rußek
Assignee: Dmitriy V. Ryaboy
 Attachments: TypeCheckingVisitor.java.diff


 I'm unable to set the format of the outgoing date string in the constructor 
 as it's supposed to work. 
 The only way i could change the format was to change the default in the java 
 class and rebuild piggybank.
 Apparently this has something to do with the way pig instantiates 
 DateExtractor, quoting a replier on the mailing list:
 David Vrensk said:
 I ran into the same problem a couple of weeks ago, and
 played around with the code inserting some print/log statements.  It turns
 out that the arguments are only used in the initial constructor calls, when
 the pig process is starting, but once pig reaches the point where it would
 use the udf, it creates new DateExtractors without passing the arguments.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1375) [Zebra] To support writing multiple Zebra tables through Pig

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1375:
---

Attachment: PIG-1375.patch

 [Zebra] To support writing multiple Zebra tables through Pig
 

 Key: PIG-1375
 URL: https://issues.apache.org/jira/browse/PIG-1375
 Project: Pig
  Issue Type: New Feature
Affects Versions: 0.8.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1375.patch


 In Zebra, we already have multiple outputs support for map/reduce.  But we do 
 not support this feature if users use Zebra through Pig.
 This jira is to address this issue. We plan to support writing to multiple 
 output tables through Pig as well.
 We propose to support the following Pig store statements with multiple 
 outputs:
 store relation into 'loc1,loc2,loc3' using 
 org.apache.hadoop.zebra.pig.TableStorer('storagehint_string',
 'complete name of your custom partition class', 'some arguments to partition 
 class'); /* if certain partition class arguments is needed */
 store relation into 'loc1,loc2,loc3' using 
 org.apache.hadoop.zebra.pig.TableStorer('storagehint_string',
 'complete name of your custom partition class'); /* if no partition class 
 arguments is needed */
 Note that users need to specify up to three arguments - storage hint string, 
 complete name of partition class and partition class arguments string.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

Attachment: PIG-1351.patch

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.6.0, 0.7.0, 0.8.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch, PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---


One small change: we only check the first non-null map value for map and the 
first record for collection.

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.6.0, 0.7.0, 0.8.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch, PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

   Status: Patch Available  (was: Open)
Affects Version/s: (was: 0.6.0)
   (was: 0.8.0)

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch, PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

Status: Open  (was: Patch Available)

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch, PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1372) Restore PigInputFormat.sJob for backward compatibility

2010-04-15 Thread Pradeep Kamath (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pradeep Kamath updated PIG-1372:


  Status: Resolved  (was: Patch Available)
Hadoop Flags: [Reviewed]
  Resolution: Fixed

All tests pass on my local machine - patch committed to 0.7 and trunk

 Restore PigInputFormat.sJob for backward compatibility
 --

 Key: PIG-1372
 URL: https://issues.apache.org/jira/browse/PIG-1372
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Pradeep Kamath
Assignee: Pradeep Kamath
 Fix For: 0.7.0

 Attachments: PIG-1372-2.patch, PIG-1372.patch


 The preferred method to get the job's Configuration object would be to use 
 UDFContext.getJobConf(). This jira is to restore PigInputFormat.sJob  (but we 
 will be marking it deprecated and indicating to use UDFContext.getJobConf() 
 instead) to be backward compatible - we can remove it from pig in a future 
 release.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1363) Unnecessary loadFunc instantiations

2010-04-15 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated PIG-1363:
--

Attachment: pig-1363_1.patch

Can't get away without pasing loader signature to backend for Merge Join. So, 
set it.

 Unnecessary loadFunc instantiations
 ---

 Key: PIG-1363
 URL: https://issues.apache.org/jira/browse/PIG-1363
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.8.0

 Attachments: pig-1363.patch, pig-1363_1.patch


 In MRCompiler loadfuncs are instantiated at multiple locations in different 
 visit methods. This is inconsistent and confusing. LoadFunc should be 
 instantiated at only one place, ideally in LogToPhyTanslation#visit(LOLoad). 
 A getter should be added to POLoad to retrieve this instantiated loadFunc 
 wherever it is needed in later stages of compilation. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Commented: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857564#action_12857564
 ] 

Hadoop QA commented on PIG-1351:


-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12441851/PIG-1351.patch
  against trunk revision 934488.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 174 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/292/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/292/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/292/console

This message is automatically generated.

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch, PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

Attachment: (was: PIG-1351.patch)

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

Attachment: (was: PIG-1351.patch)

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1351) [Zebra] No type check when we write to the basic table

2010-04-15 Thread Chao Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chao Wang updated PIG-1351:
---

Attachment: PIG-1351.patch

 [Zebra] No type check when we write to the basic table
 --

 Key: PIG-1351
 URL: https://issues.apache.org/jira/browse/PIG-1351
 Project: Pig
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Chao Wang
Assignee: Chao Wang
 Fix For: 0.8.0

 Attachments: PIG-1351.patch


 In Zebra, we do not have any type check when writing to a basic table. 
 Say, we have a schema: f1:int, f2:string,
 however we can write a tuple (abc, 123) without any problem, which is 
 definitely not desirable.
 To overcome this problem, we decide to perform certain amount of type 
 checking in Zebra - We check the first row only for each writer.
 This only serves as a sanity check purpose in cases where users screw up 
 specifying the output schema. We do NOT perform a rigorous type checking for 
 all rows for apparently performance concerns.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




Re: TypeCheckingVisitor and casting to less precise numeric types

2010-04-15 Thread Alan Gates
You are correct that all of these casts can be done.  We omitted them  
explicitly because of what you said that we did not want to loose  
precision.  We should be able to downcast when users ask explicitly  
for it, but we don't want to do this implicitly.


Alan.

On Mar 24, 2010, at 2:47 PM, Anil Chawla wrote:




Hi,
I know that Pig has logic for casting inputs to the expected data  
types

when invoking a UDF and I understand that this logic resides in the
TypeCheckingVisitor class. I am curious to know why certain casts  
have been
omitted from the castLookup map. Specifically, I do not see any  
entries
for casting a more precise numeric type (e.g. Double) to a less  
precise
numeric type (e.g. Integer). Any reason why all down conversions of  
numeric

types have been omitted? Is it because we do not want to perform any
automatic casts that lead to a loss of precision (loss of data)?

In my situation, we are trying to abstract all numeric data types  
into a
single number type. If a UDF takes a numeric parameter, we want  
Pig to

invoke that UDF with any numeric argument, regardless of whether the
argument must be upconverted or downconverted. We are OK with the  
loss of
precision in that circumstance. As a result, we added the following  
to the

castLookup map:

castLookup.put(DataType.LONG, DataType.INTEGER);
castLookup.put(DataType.FLOAT, DataType.LONG);
castLookup.put(DataType.FLOAT, DataType.INTEGER);
castLookup.put(DataType.DOUBLE, DataType.FLOAT);
castLookup.put(DataType.DOUBLE, DataType.LONG);
castLookup.put(DataType.DOUBLE, DataType.INTEGER);

All of these casts seem to work fine our tests. Other than loss of
precision, is there any reason why adding these casts might be a bad  
idea?


Thanks,
-Anil




[jira] Commented: (PIG-1363) Unnecessary loadFunc instantiations

2010-04-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857590#action_12857590
 ] 

Hadoop QA commented on PIG-1363:


-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12441862/pig-1363_1.patch
  against trunk revision 934488.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no tests are needed for this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/298/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/298/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Pig-Patch-h8.grid.sp2.yahoo.net/298/console

This message is automatically generated.

 Unnecessary loadFunc instantiations
 ---

 Key: PIG-1363
 URL: https://issues.apache.org/jira/browse/PIG-1363
 Project: Pig
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.8.0

 Attachments: pig-1363.patch, pig-1363_1.patch


 In MRCompiler loadfuncs are instantiated at multiple locations in different 
 visit methods. This is inconsistent and confusing. LoadFunc should be 
 instantiated at only one place, ideally in LogToPhyTanslation#visit(LOLoad). 
 A getter should be added to POLoad to retrieve this instantiated loadFunc 
 wherever it is needed in later stages of compilation. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Assigned: (PIG-1331) Owl Hadoop Table Management Service

2010-04-15 Thread Alan Gates (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alan Gates reassigned PIG-1331:
---

Assignee: Ajay Kidave

 Owl Hadoop Table Management Service
 ---

 Key: PIG-1331
 URL: https://issues.apache.org/jira/browse/PIG-1331
 Project: Pig
  Issue Type: New Feature
Affects Versions: 0.8.0
Reporter: Jay Tang
Assignee: Ajay Kidave
 Attachments: anttestoutput.tgz, build.log, ivy_version.patch, 
 owl.contrib.3.tgz, owl.contrib.4.tar.gz


 This JIRA is a proposal to create a Hadoop table management service: Owl. 
 Today, MapReduce and Pig applications interacts directly with HDFS 
 directories and files and must deal with low level data management issues 
 such as storage format, serialization/compression schemes, data layout, and 
 efficient data accesses, etc, often with different solutions. Owl aims to 
 provide a standard way to addresses this issue and abstracts away the 
 complexities of reading/writing huge amount of data from/to HDFS.
 Owl has a data access API that is modeled after the traditional Hadoop 
 !InputFormt and a management API to manipulate Owl objects.  This JIRA is 
 related to Pig-823 (Hadoop Metadata Service) as Owl has an internal metadata 
 store.  Owl integrates with different storage module like Zebra with a 
 pluggable architecture.
  Initially, the proposal is to submit Owl as a Pig contrib project.  Over 
 time, it makes sense to move it to a Hadoop subproject.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Updated: (PIG-1331) Owl Hadoop Table Management Service

2010-04-15 Thread Alan Gates (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alan Gates updated PIG-1331:


   Status: Resolved  (was: Patch Available)
Fix Version/s: 0.8.0
   Resolution: Fixed

Patch 4 committed.  

 Owl Hadoop Table Management Service
 ---

 Key: PIG-1331
 URL: https://issues.apache.org/jira/browse/PIG-1331
 Project: Pig
  Issue Type: New Feature
Affects Versions: 0.8.0
Reporter: Jay Tang
Assignee: Ajay Kidave
 Fix For: 0.8.0

 Attachments: anttestoutput.tgz, build.log, ivy_version.patch, 
 owl.contrib.3.tgz, owl.contrib.4.tar.gz


 This JIRA is a proposal to create a Hadoop table management service: Owl. 
 Today, MapReduce and Pig applications interacts directly with HDFS 
 directories and files and must deal with low level data management issues 
 such as storage format, serialization/compression schemes, data layout, and 
 efficient data accesses, etc, often with different solutions. Owl aims to 
 provide a standard way to addresses this issue and abstracts away the 
 complexities of reading/writing huge amount of data from/to HDFS.
 Owl has a data access API that is modeled after the traditional Hadoop 
 !InputFormt and a management API to manipulate Owl objects.  This JIRA is 
 related to Pig-823 (Hadoop Metadata Service) as Owl has an internal metadata 
 store.  Owl integrates with different storage module like Zebra with a 
 pluggable architecture.
  Initially, the proposal is to submit Owl as a Pig contrib project.  Over 
 time, it makes sense to move it to a Hadoop subproject.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Created: (PIG-1379) Jars registered from command line should override the ones present in the script

2010-04-15 Thread Ankur (JIRA)
Jars registered from command line should override the ones present in the 
script 
-

 Key: PIG-1379
 URL: https://issues.apache.org/jira/browse/PIG-1379
 Project: Pig
  Issue Type: Improvement
Reporter: Ankur
 Fix For: 0.7.0


Jars that are registered from the command line when executing the pig script 
should override the ones that are specified via 'register' in the pig script 
itself.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira