Distibuted search setup is quite simple.
http://wiki.apache.org/nutch/NutchHadoopTutorial
Go to section distributed search. This works for us. We search over several 
indexes.
Nadine.

-----Ursprüngliche Nachricht-----
Von: ianwong [mailto:[EMAIL PROTECTED] 
Gesendet: Sonntag, 7. Dezember 2008 01:23
An: [email protected]
Betreff: how to query mutiple index?


I think many people need crawler handle new sites and generate incremental
index.  In that case, we have to re-crawl unfetched urls and also update
fetched urls if update time is reached.  In addtion, I think current nutch
version can not only handle unfecthed urls, right? If not, can it be a
option in furture function. 

So,  I want to know, can nutch support mutiple indexSearch?  If  yes, I can
create different crawler for differnt group websites, and query them
together. I do not know how to implement it by nutch, just thought something
in lucene side.

        IndexSearcher searcher1 = new IndexSearcher(INDEX_STORE_PATH1);
        IndexSearcher searcher2 = new IndexSearcher(INDEX_STORE_PATH2);
        IndexSearcher[] searchers = { searcher1, searcher2 };
        MultiSearcher searcher = new MultiSearcher(searchers);

Br,
Ian 
-- 
View this message in context: 
http://www.nabble.com/how-to-query-mutiple-index--tp20876355p20876355.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to