[jira] Commented: (PIG-823) Hadoop Metadata Service

Jeff Hammerbacher (JIRA) Fri, 29 May 2009 11:36:24 -0700

    [ 
https://issues.apache.org/jira/browse/PIG-823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12714539#action_12714539
 ]


Jeff Hammerbacher commented on PIG-823:
---------------------------------------

Hey,

Hadoop already had a metadata service (well defined at 
http://svn.apache.org/viewvc/hadoop/hive/trunk/metastore/if/hive_metastore.thrift)
 and a SQL implementation in production use at scale at several organizations. 
Can any of that work be reused for this purpose? It seems like duplicating 
effort across subprojects is a bad idea.

Later,
Jeff

> Hadoop Metadata Service
> -----------------------
>
>                 Key: PIG-823
>                 URL: https://issues.apache.org/jira/browse/PIG-823
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Olga Natkovich
>
> This JIRA is created to track development of a metadata system for  Hadoop. 
> The goal of the system is to allow users and applications to register data 
> stored on HDFS, search for the data available on HDFS, and associate metadata 
> such as schema, statistics, etc. with a particular data unit or a data set 
> stored on HDFS. The initial goal is to provide a fairly generic, low level 
> abstraction that any user or application on HDFS can use to store an retrieve 
> metadata. Over time a higher level abstractions closely tied to particular 
> applications or tools can be developed.
> Over time, it would make sense for the metadata service to become a 
> subproject within Hadoop. For now, the proposal is to make it a contrib to 
> Pig since Pig SQL is likely to be the first user of the system.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (PIG-823) Hadoop Metadata Service

Reply via email to