Distibuted search setup is quite simple. http://wiki.apache.org/nutch/NutchHadoopTutorial Go to section distributed search. This works for us. We search over several indexes. Nadine.
-----Ursprüngliche Nachricht----- Von: ianwong [mailto:[EMAIL PROTECTED] Gesendet: Sonntag, 7. Dezember 2008 01:23 An: [email protected] Betreff: how to query mutiple index? I think many people need crawler handle new sites and generate incremental index. In that case, we have to re-crawl unfetched urls and also update fetched urls if update time is reached. In addtion, I think current nutch version can not only handle unfecthed urls, right? If not, can it be a option in furture function. So, I want to know, can nutch support mutiple indexSearch? If yes, I can create different crawler for differnt group websites, and query them together. I do not know how to implement it by nutch, just thought something in lucene side. IndexSearcher searcher1 = new IndexSearcher(INDEX_STORE_PATH1); IndexSearcher searcher2 = new IndexSearcher(INDEX_STORE_PATH2); IndexSearcher[] searchers = { searcher1, searcher2 }; MultiSearcher searcher = new MultiSearcher(searchers); Br, Ian -- View this message in context: http://www.nabble.com/how-to-query-mutiple-index--tp20876355p20876355.html Sent from the Nutch - User mailing list archive at Nabble.com.
