Anyone here an old timer Apple Newton user?
I've been really getting jazzed on the ideas I'm getting thanks to
Solr and contemplating Ruby integration. I've been re-reading my
dusty "Programming for the Newton" (using Windows!) book. The
discussion of the Newton "soup" data storage mechanism is very much
on track with what I'd like to implement from the Ruby side of things
using Solr as the "soups" storage. I think more needs to be done
with Solr than just faster replication to enable a flexible schema
scenario. Back to the Newton analogy, each application registers its
own schema but everything fits into a common storage system allowing
a unified querying mechanism. Merging queries/data across soups is
not done except at the application level, but I can see in the Solr
case that custom handlers can facilitate this sort of thing to free
the client from having to deal with the massive amount of data.
I've been mulling over the idea of having a single Solr instance
morph into system that can handle multiple client-defined schemas
(why not? Lucene itself can handle it) rather than a static XML file
and allow the schemas themselves to be retrievable (yes, I know it
already is). I'm still talking about a single Lucene index, but with
each Document given a "soup" name field and filters automatically
available to single out a specific soup.
Make sense? I think the GData thing fits with the loosely defined
schema scenario as well.
Thoughts?
I was going to wait until my thoughts were more gelled on this topic,
but the GData thread brought me out of my cave earlier.
Erik
On Apr 25, 2006, at 3:16 PM, jason rutherglen wrote:
http://jeremy.zawodny.com/blog/archives/006687.html
Here is a good blog entry with a talk on GData from someone who
worked on it. The only thing I think Solr needs is faster
replication, which perhaps can be done faster using a direct
replication model, preferably over HTTP of the segments files
instead of rsync? Reserving rsync for the optimized index sync.
The only other thing GData does is versioning of the documents.