Re: GData

Erik Hatcher Tue, 25 Apr 2006 13:25:51 -0700

Anyone here an old timer Apple Newton user?

I've been really getting jazzed on the ideas I'm getting thanks toSolr and contemplating Ruby integration. I've been re-reading mydusty "Programming for the Newton" (using Windows!) book. Thediscussion of the Newton "soup" data storage mechanism is very muchon track with what I'd like to implement from the Ruby side of thingsusing Solr as the "soups" storage. I think more needs to be donewith Solr than just faster replication to enable a flexible schemascenario. Back to the Newton analogy, each application registers itsown schema but everything fits into a common storage system allowinga unified querying mechanism. Merging queries/data across soups isnot done except at the application level, but I can see in the Solrcase that custom handlers can facilitate this sort of thing to freethe client from having to deal with the massive amount of data.

I've been mulling over the idea of having a single Solr instancemorph into system that can handle multiple client-defined schemas(why not? Lucene itself can handle it) rather than a static XML fileand allow the schemas themselves to be retrievable (yes, I know italready is). I'm still talking about a single Lucene index, but witheach Document given a "soup" name field and filters automaticallyavailable to single out a specific soup.

Make sense? I think the GData thing fits with the loosely definedschema scenario as well.


Thoughts?

I was going to wait until my thoughts were more gelled on this topic,but the GData thread brought me out of my cave earlier.


        Erik



On Apr 25, 2006, at 3:16 PM, jason rutherglen wrote:

http://jeremy.zawodny.com/blog/archives/006687.html
Here is a good blog entry with a talk on GData from someone whoworked on it. The only thing I think Solr needs is fasterreplication, which perhaps can be done faster using a directreplication model, preferably over HTTP of the segments filesinstead of rsync? Reserving rsync for the optimized index sync.The only other thing GData does is versioning of the documents.

Re: GData

Reply via email to