Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by NycoNyco: http://wiki.apache.org/nutch/HardwareRequirements The comment on the change is: title ------------------------------------------------------------------------------ - = Hardware Requirements = In general, fetching and database updates require lots of disk, and searching is faster with more RAM. But the particulars depend on how big of an index you're trying to build and how much query traffic you expect. + + == Requirements for indexing == As a general rule, each page fetched requires around 10k of disk overall (for the page cache, its text, the index, db entries, etc.). So a terabyte of storage is required for every 100M pages.