Gal Nitzan schrieb:
I have copied only the segments directory but the searcher returns 0
hits.
You have to put the index and segments dir into a directory named "crawl" and start tomcat from the directory that contains crawl. The nutch.war file contains a nutch-default.xml with

<property>
 <name>searcher.dir</name>
 <value>crawl</value>
 <description>
 Path to root of crawl.  This directory is searched (in
 order) for either the file search-servers.txt, containing a list of
 distributed search servers, or the directory "index" containing
 merged indexes, or the directory "segments" containing segment
 indexes.
 </description>
</property>

Do I need to copy the linkdb and the index folders as well?
No, the linkdb contains an inverted link list (for each url all urls that point to it) and is only used to calculate the page score while indexing.

best regards,
Dominik

Reply via email to