I'd like to know what are all the known techniques for speeding up MapReduce for a single user machine.
So far, I know of this patch: http://issues.apache.org/jira/browse/NUTCH-395 I also am reading that changing hadoop-site.xml can help, but I don't know what changes to make. Please add anything you've found that will help. I am considering going back to 0.7 if I can't get Nutch to go faster. In my case I am also crawling just a single site. Ben
