And there is http://wiki.apache.org/solr/DistributedSearch , but this talks *only* about search.
Dennis, are you the man to take what's on DistributedLucene and DistributedSearch and come up with a marriage proposal? :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- From: Andrzej Bialecki <[EMAIL PROTECTED]> To: [email protected] Sent: Monday, April 14, 2008 1:01:37 PM Subject: Re: Next Generation Nutch Dennis Kubes wrote: > > > Otis Gospodnetic wrote: >> I suppose the first thing to do would be describe the requirements for >> this shard management. I imagine you have very specific functionality >> in mind from your Wikia Search experience. Mind putting your ideas on >> the Wiki? I think it would be very good to share this with >> [EMAIL PROTECTED] early on, so we can come up with something general >> that fits both Nutch and Solr. It might turn out that this calls for >> a separate Lucene project, but we'll see that once the real discussion >> starts. >> > > I completely agree. This would be better as a shared project. I will > put my current thoughts down on the Nutch wiki, unless there is already > a discussion going somewhere? There is a description of a related concept here: http://wiki.apache.org/hadoop/DistributedLucene . However, this addresses only the index part of the shard - in our case shards also contain plain text (for summaries) and the original binary content (for cached preview), and possibly other parts (NUTCH-466) neither of which is managed by this code. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
