[ 
https://issues.apache.org/jira/browse/PIG-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12850347#action_12850347
 ] 

Jay Tang commented on PIG-1331:
-------------------------------

Owl has an internal metastore that has a similar relational table and partition 
model with Hive's metastore.  Owl goes beyond this and provides a uniform data 
access mechanism on top of multiple storage format.  This interface can be 
leveraged by Pig and MapReduce applications.  There is room for collaboration 
between Owl and Hive so that we could eventually converge on a common metastore 
for Hadoop.

> Owl Hadoop Table Management Service
> -----------------------------------
>
>                 Key: PIG-1331
>                 URL: https://issues.apache.org/jira/browse/PIG-1331
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Jay Tang
>
> This JIRA is a proposal to create a Hadoop table management service: Owl. 
> Today, MapReduce and Pig applications interacts directly with HDFS 
> directories and files and must deal with low level data management issues 
> such as storage format, serialization/compression schemes, data layout, and 
> efficient data accesses, etc, often with different solutions. Owl aims to 
> provide a standard way to addresses this issue and abstracts away the 
> complexities of reading/writing huge amount of data from/to HDFS.
> Owl has a data access API that is modeled after the traditional Hadoop 
> !InputFormt and a management API to manipulate Owl objects.  This JIRA is 
> related to Pig-823 (Hadoop Metadata Service) as Owl has an internal metadata 
> store.  Owl integrates with different storage module like Zebra with a 
> pluggable architecture.
>  Initially, the proposal is to submit Owl as a Pig contrib project.  Over 
> time, it makes sense to move it to a Hadoop subproject.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to