Re: Flexible, use case specific metadata cataloging for CAS

Mattmann, Chris A (388J) Sun, 24 Mar 2013 14:20:36 -0700

Lewis,

Since I still have not officially received this email from dev@oodt,
(grr) and since I saw it on the mail archives, I'm going to copy your
email below, with the same subject, and hope it gets threaded right :)


Comments below:


On 3/20/13 5:55 PM, Lewis John Mcgibbney (lewi...@gmail.com) wrote:

>
>All,
>
>I picked up OODT today and immediately thought about an implementation of
>Apache Gora [0] for abstracting persistence within the CAS metadata
>catalogue.

+1, I've wanted this for a while. A GoraCatalog implementation of the FM
Catalog
Interface.

>Right now, for me, the persistence of my metadata catalogue to Lucene or
>MySQL is sufficient and I have no immediate justification for using some
>alternative storage mechanism however I noticed that there are a few areas
>where OODT could generally benefit from the Gora implementation.
>It is natural that product discovery via daemon driven CAS crawler (for
>example) will fire product streams of varying nature towards the catalogue
>storage mechanism. Lucene or MySQL my not be best best option to store
>such
>streams of data and/or the best way to later retrieve that data. Gora
>would
>enable a much more comprehensive variety of data stores to be available
>for
>persistence of catalogue metadata and would also provide a much more
>flexible model specifically geared towards better solutions for metadata
>cataloguing. Currently we support Amazon DynamoDB, Accumulo, Cassandra,
>HBase, HDFS, HSQLDB and MySQL. We have patches for Solr, MongoDB and
>various file based stores. There is also interest to implement an Oracle
>NoSQL DB.... don't ask.

Haha!

>I notice that the SolrIndexer tool implemented by Paul provides an
>expressive number of options for indexing to your Solr HTTP server. The
>gora-solr module would provide all these plus more.
>I suppose this entirely depends on the requirements for expanding metadata
>catalogues within the File Manager.
>Is it envisaged that such an implementation is required for some use cases
>or would be required?

Yes, please, help! :)

>As Gora builds on Hadoop principles, I suppose it would also enable folks
>use their metadata catalogues in different, possibly useful, use-case
>adaptable ways.
>Just an initial thought.

A great one at that, I would be super +1 for a GoraCatalog to help in these
situations and would be keen to work on it with you.

Cheers,
Chris

>Thanks
>Lewis
>
>
>[0] http://gora.apache.org <http://gora.apache.org/>
>--
>*Lewis*
>
>

Re: Flexible, use case specific metadata cataloging for CAS

Reply via email to