1) Installing and configuration of Nutch 2) Crawling the web, the CrawlDb, and URL filters 3) Parsing and Parse filters 4) Nutch plugins and plugin architecture 5) Analysis, Link analysis, and scoring 6) Indexing and custom fields 7) Deployment, shard architecture 8) Writing custom tools for Nutch 9) Hadoop architecture
Are there other things people would want to go over? Dennis
