Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "CrawlDatumStates" page has been changed by MarkusJelsma: http://wiki.apache.org/nutch/CrawlDatumStates?action=diff&rev1=4&rev2=5 Comment: added scoreupdater *Injector - to populate CrawlDb with new URLs *Generator - to generate new fetchlists, and optionally mark those URLs in CrawlDb as "being in the process of fetching" *CrawlDb update - to update the CrawlDb with new knowledge about the already known URLs (already in CrawlDb) as well as add new URLs discovered from page outlinks. + *[[http://wiki.apache.org/nutch/NewScoring#ScoreUpdater|ScoreUpdater]] updates the CrawlDB with LinkRank calculated URL scores. Below is a state diagram of CrawlDatum, which is a class that holds this state in CrawlDb.

