[
https://issues.apache.org/jira/browse/PIG-823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12714539#action_12714539
]
Jeff Hammerbacher commented on PIG-823:
---------------------------------------
Hey,
Hadoop already had a metadata service (well defined at
http://svn.apache.org/viewvc/hadoop/hive/trunk/metastore/if/hive_metastore.thrift)
and a SQL implementation in production use at scale at several organizations.
Can any of that work be reused for this purpose? It seems like duplicating
effort across subprojects is a bad idea.
Later,
Jeff
> Hadoop Metadata Service
> -----------------------
>
> Key: PIG-823
> URL: https://issues.apache.org/jira/browse/PIG-823
> Project: Pig
> Issue Type: New Feature
> Reporter: Olga Natkovich
>
> This JIRA is created to track development of a metadata system for Hadoop.
> The goal of the system is to allow users and applications to register data
> stored on HDFS, search for the data available on HDFS, and associate metadata
> such as schema, statistics, etc. with a particular data unit or a data set
> stored on HDFS. The initial goal is to provide a fairly generic, low level
> abstraction that any user or application on HDFS can use to store an retrieve
> metadata. Over time a higher level abstractions closely tied to particular
> applications or tools can be developed.
> Over time, it would make sense for the metadata service to become a
> subproject within Hadoop. For now, the proposal is to make it a contrib to
> Pig since Pig SQL is likely to be the first user of the system.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.