[
https://issues.apache.org/jira/browse/PIG-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12850347#action_12850347
]
Jay Tang commented on PIG-1331:
-------------------------------
Owl has an internal metastore that has a similar relational table and partition
model with Hive's metastore. Owl goes beyond this and provides a uniform data
access mechanism on top of multiple storage format. This interface can be
leveraged by Pig and MapReduce applications. There is room for collaboration
between Owl and Hive so that we could eventually converge on a common metastore
for Hadoop.
> Owl Hadoop Table Management Service
> -----------------------------------
>
> Key: PIG-1331
> URL: https://issues.apache.org/jira/browse/PIG-1331
> Project: Pig
> Issue Type: New Feature
> Reporter: Jay Tang
>
> This JIRA is a proposal to create a Hadoop table management service: Owl.
> Today, MapReduce and Pig applications interacts directly with HDFS
> directories and files and must deal with low level data management issues
> such as storage format, serialization/compression schemes, data layout, and
> efficient data accesses, etc, often with different solutions. Owl aims to
> provide a standard way to addresses this issue and abstracts away the
> complexities of reading/writing huge amount of data from/to HDFS.
> Owl has a data access API that is modeled after the traditional Hadoop
> !InputFormt and a management API to manipulate Owl objects. This JIRA is
> related to Pig-823 (Hadoop Metadata Service) as Owl has an internal metadata
> store. Owl integrates with different storage module like Zebra with a
> pluggable architecture.
> Initially, the proposal is to submit Owl as a Pig contrib project. Over
> time, it makes sense to move it to a Hadoop subproject.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.