Matei Zaharia commented on PIG-823:
I agree with Jeff that that it might be better to make this service a feature
of HDFS rather than a component of Pig. A metadata service might be useful to
people who don't use Pig at all, e.g. who just load data and process it with
MapReduce (which is a use case you cover on the Wiki page). Having a single,
standard metadata service would allow unrelated tools for loading data,
processing it, browsing it, etc to interoperate.
> Hadoop Metadata Service
> Key: PIG-823
> URL: https://issues.apache.org/jira/browse/PIG-823
> Project: Pig
> Issue Type: New Feature
> Reporter: Olga Natkovich
> This JIRA is created to track development of a metadata system for Hadoop.
> The goal of the system is to allow users and applications to register data
> stored on HDFS, search for the data available on HDFS, and associate metadata
> such as schema, statistics, etc. with a particular data unit or a data set
> stored on HDFS. The initial goal is to provide a fairly generic, low level
> abstraction that any user or application on HDFS can use to store an retrieve
> metadata. Over time a higher level abstractions closely tied to particular
> applications or tools can be developed.
> Over time, it would make sense for the metadata service to become a
> subproject within Hadoop. For now, the proposal is to make it a contrib to
> Pig since Pig SQL is likely to be the first user of the system.
This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.