On Sun, Jan 17, 2010 at 9:06 AM, Andrzej Bialecki <a...@getopt.org> wrote: > On 2010-01-16 21:11, Yonik Seeley wrote: >> If we were building from scratch perhaps - but it seems like if we can >> just model what people do today with Solr (but just make it a lot >> easier), that's a good start. The opaque model is what we have today, >> and it's conceptually simple... the complete collection consists of >> all the unique shard ids (or slices) you know about. > > I would argue that the current model has been adopted out of necessity, and > not because of the users' preference.
I think it's both - I've seen quite a few people that really wanted to partition by time for example (and they made some compelling cases for doing so). Seems like a good goal would be to support the customer having various levels of control. > Unless you want an expert-level total > control over what node runs what part of the index, isn't it much more > convenient to delegate all the partitioning and deployment to your "search > cluster" instead of managing the partitioning and deployment yourself? Certainly - we do want to get to the "just handle everything for me" phase. It just feels like there is a lot more development work to do before we can make that happen. Reliably supporting near realtime updates in a replicated environment is hard and will take some time. -Yonik http://www.lucidimagination.com