Anyone here an old timer Apple Newton user?

I've been really getting jazzed on the ideas I'm getting thanks to Solr and contemplating Ruby integration. I've been re-reading my dusty "Programming for the Newton" (using Windows!) book. The discussion of the Newton "soup" data storage mechanism is very much on track with what I'd like to implement from the Ruby side of things using Solr as the "soups" storage. I think more needs to be done with Solr than just faster replication to enable a flexible schema scenario. Back to the Newton analogy, each application registers its own schema but everything fits into a common storage system allowing a unified querying mechanism. Merging queries/data across soups is not done except at the application level, but I can see in the Solr case that custom handlers can facilitate this sort of thing to free the client from having to deal with the massive amount of data.

I've been mulling over the idea of having a single Solr instance morph into system that can handle multiple client-defined schemas (why not? Lucene itself can handle it) rather than a static XML file and allow the schemas themselves to be retrievable (yes, I know it already is). I'm still talking about a single Lucene index, but with each Document given a "soup" name field and filters automatically available to single out a specific soup.

Make sense? I think the GData thing fits with the loosely defined schema scenario as well.

Thoughts?

I was going to wait until my thoughts were more gelled on this topic, but the GData thread brought me out of my cave earlier.

        Erik



On Apr 25, 2006, at 3:16 PM, jason rutherglen wrote:

http://jeremy.zawodny.com/blog/archives/006687.html

Here is a good blog entry with a talk on GData from someone who worked on it. The only thing I think Solr needs is faster replication, which perhaps can be done faster using a direct replication model, preferably over HTTP of the segments files instead of rsync? Reserving rsync for the optimized index sync. The only other thing GData does is versioning of the documents.


Reply via email to