Hi all,
Have a distributed search issue I need some advice on. The scenario is that I
have tomcat running off one server and two nutch search servers running off two
other machines (so 3 machines in total). I've setup the nutch war to correctly
call the search servers and they respond. Problem is I get duplicate results.
Now I have the same data/information from the crawl copied on both machines so
the crawl data is replicated on both machines.
Questions:
1) how do I prevent the duplicate response? If I start a third search server I
only get two duplicate responses so it doesn't seem to increase with the number
of search servers
2) does tomcat wait for ALL search servers to respond before displaying the
query result or does it display the result as soon as one server responds?
3) in terms of load sharing, what is the best approach for distributed search
servers?
Any help would be greatly appreciated!
Thanks,
Hilkiah G. Lavinier MEng (Hons), ACGI
6 Winston Lane,
Goodwill,
Roseau, Dominica
Mbl: (767) 275 3382
Hm : (767) 440 3924
Fax: (767) 440 4991
VoIP USA: (646) 432 4487
Email: [EMAIL PROTECTED]
Email: [EMAIL PROTECTED]
IM: Yahoo hilkiah / MSN [EMAIL PROTECTED]
IM: ICQ #8978201 / AOL hilkiah21
____________________________________________________________________________________
Looking for last minute shopping deals?
Find them fast with Yahoo! Search.
http://tools.search.yahoo.com/newsearch/category.php?category=shopping