[jira] Commented: (PIG-823) Hadoop Metadata Service

Matei Zaharia (JIRA) Tue, 09 Jun 2009 17:17:34 -0700

    [ 
https://issues.apache.org/jira/browse/PIG-823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717876#action_12717876
 ]


Matei Zaharia commented on PIG-823:
-----------------------------------

I agree with Jeff that that it might be better to make this service a feature 
of HDFS rather than a component of Pig. A metadata service might be useful to 
people who don't use Pig at all, e.g. who just load data and process it with 
MapReduce (which is a use case you cover on the Wiki page). Having a single, 
standard metadata service would allow unrelated tools for loading data, 
processing it, browsing it, etc to interoperate.

> Hadoop Metadata Service
> -----------------------
>
>                 Key: PIG-823
>                 URL: https://issues.apache.org/jira/browse/PIG-823
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Olga Natkovich
>
> This JIRA is created to track development of a metadata system for  Hadoop. 
> The goal of the system is to allow users and applications to register data 
> stored on HDFS, search for the data available on HDFS, and associate metadata 
> such as schema, statistics, etc. with a particular data unit or a data set 
> stored on HDFS. The initial goal is to provide a fairly generic, low level 
> abstraction that any user or application on HDFS can use to store an retrieve 
> metadata. Over time a higher level abstractions closely tied to particular 
> applications or tools can be developed.
> Over time, it would make sense for the metadata service to become a 
> subproject within Hadoop. For now, the proposal is to make it a contrib to 
> Pig since Pig SQL is likely to be the first user of the system.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (PIG-823) Hadoop Metadata Service

Reply via email to