hi Ting, Have a look at the "scoring-opic" plugin and the package " org.apache.nutch.scoring.*".
"opic" is the algorithm used by nutch to determine a page's static importance. Basically speaking it does the same job as google's pagerank algorithm. some issues (probarbly fixed?) regarding the implementation (link to the original opic-paper can be found there) : http://wiki.apache.org/nutch/FixingOpicScoring regarding the issues - sombody just got (or tries to get) his hands on the opic-plugin. Probably you want to keep an eye on that thread: <file:///C:/Programme/Java/nutch-0.9/src/plugin/scoring-opic/src/java/org/apache/nutch/scoring/opic/> http://www.mail-archive.com/nutch-user%40lucene.apache.org/msg09166.html<http://www.mail-archive.com/nutch-user%2540lucene.apache.org/msg09166.html> Cheers, martin On 8/14/07, ting <[EMAIL PROTECTED]> wrote: > > Excuse me: > > I'am Ting ,I am studying nutch now and have some questions. > > I use nutch-0.9 and reading its source code now. , but it is so > different > from nutch-0.7.I want to know how to use nutch-0.9 to crawl the whole web > , > I can't find the code like nutch-0.7 to crawl the whole web. Then, How > dose > nutch-0.9 give scores to pages, dose it use Pagerank. Where is the code to > handle pagerank.. I can't find the code! > > Please help me! Thank you! > >
