Regarding the following, found on the nutch.org Developers page:

"Nutch has not yet been tuned for quality. There are ten or twenty knobs that we can twiddle to adjust the ranking formula. We have started developing software to do this tuning automatically, but the current code just contains guesses. With a little tuning we should be able to get results that are competitive with those of major search engines."

You might consider using Particle Swarm Optimization (PSO) for optimizing the tuning. The PSO algorithm is surprisingly fast at finding optimum (or optimal) solutions to multi-dimensioned (multi-knobbed) problems. You can find some information about PSO on my site as well as an open source java version of the basic PSO algorithm -- see www.adaptiveview.com/resources/index.html#source -- and I'd be more than happy to answer any questions you might have.


Paul Pomeroy



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to