On 9/14/06, Tomi NA <[EMAIL PROTECTED]> wrote: > On 9/5/06, Zaheed Haque <[EMAIL PROTECTED]> wrote: > > Hi: > > I have a problem or two with the described procedure... > > > Assuming you have > > > > index 1 at /data/crawl1 > > index 2 at /data/crawl2 > > Used ./bin/nutch crawl urls -dir /home/myhome/crawls/mycrawldir to > generate an index: luke says the index is valid and I can query it > using luke's interface. > > Does the "searcher.dir" value in nutch-(default|site).xml have any > impact on the way indexes are created?
No it doesn't have any impact on index creation. searcher.dir value is for searching only. nutch-site.xml is where you should change.. example... <property> <name>searcher.dir</name> <value> /home/myhome/crawls</value> <description> Path to root of index directories. This directory is searched (in order) for either the file search-servers.txt, containing a list of distributed search servers, or the directory "index" containing merged indexes, or the directory "segments" containing segment indexes. </description> </property> and the text file should be in this case ... /home/myhome/crawls/search-servers.txt > > In nutch-site.xml > > searcher.dir = /data > > This is the nutch-site.xml of the web UI? Both. I mean tomcat/webapps/ROOT/WEB-INF/classes/nutch-site.xml as well as NUTCH HOME/conf/nutch-site.xml. Web application needs to know where the search-servers.txt file is if you plan to use tomcat to search. > > Under /data you have a text file called search-server.txt (I think do > > check nutch-site search.dir description please) > > /home/myhome/crawls/search-servers.txt > > > In the text file you will have the following > > > > hostname1 portnumber > > hostname2 portnumber > > > > example > > localhost 1234 > > localhost 5678 > > I placed > localhost 12567 > (just one instance, to test) > > > Then you need to start > > > > bin/nutch server 1234 /data/craw1 & > > > > and > > > > bin/nutch server 5678 /data/crawl2 & > > did that, using port 12567 > ./bin/nutch server 12567 /home/mydir/crawls/mycrawldir & > > > bin/nutch org.apache.nutch.search.NutchBean www > > > > you should see results :-) > > I get: > ------------ > Exception in thread "main" java.lang.NoClassDefFoundError: > org/apache/nutch/search/NutchBean > ------------ > > Whats more, I get no results to any query I care to pass by the Web > UI, which suggests the UI isn't connected to the underlying > DistributedSearch server. :\ > > Any hints, anyone? > > TIA, > t.n.a. > ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
