Hi Folks!
I'm a newbie here, so please forgive me if my very first question might be ever so silly:
Is there an "easy" way (without messing up some source code) to adjust the result ranking of nutch in that way, that short URLs will be boosted
to the top?
To put it as a real life example: I've built a "Hannover Search Engine" :-) (which you might test at http://suma-lab.de:8081). It has crawled some hundreds of servers of the Hannover area. Now if someone searches for simple stuff (like the word >hannover<), simple (=short) URLs should be at the top (like www.hannover.de). This is not what happens, instead lengthly URLs out of the depth of Hannover servers are at the top positions.
How to change that ranking behavior?
Please see the thread titled "Adding title and site to scoring". I'm currently overloaded with work, so I haven't started yet working on this patch - feel free to contribute ;-)
-- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
