I think many people need crawler handle new sites and generate incremental
index. In that case, we have to re-crawl unfetched urls and also update
fetched urls if update time is reached. In addtion, I think current nutch
version can not only handle unfecthed urls, right? If not, can it be a
option in furture function.
So, I want to know, can nutch support mutiple indexSearch? If yes, I can
create different crawler for differnt group websites, and query them
together. I do not know how to implement it by nutch, just thought something
in lucene side.
IndexSearcher searcher1 = new IndexSearcher(INDEX_STORE_PATH1);
IndexSearcher searcher2 = new IndexSearcher(INDEX_STORE_PATH2);
IndexSearcher[] searchers = { searcher1, searcher2 };
MultiSearcher searcher = new MultiSearcher(searchers);
Br,
Ian
--
View this message in context:
http://www.nabble.com/how-to-query-mutiple-index--tp20876355p20876355.html
Sent from the Nutch - User mailing list archive at Nabble.com.