On Fri, Dec 26, 2008 at 2:35 AM, buddha1021 <[email protected]> wrote: > > hi all: > I am very interested in the nutch! I want to ask some questions about > nutch: > (1)Can nutch search 1 billion(=1000 millions) pages that the size of the > page's data will achive 10T(=10000G) bytes? one page's size ==10k .
Nutch use hadoop, which rely on HDFS to store it's data. and HDFS can certainly handle 10TB. > (2)If nutch can do this ,what about the speed of the search ,compared with > google ? Can the speed of the search meet the people's requirement ? I don't know. you should take a look at lucence performance. http://www.google.fr/search?q=lucene+performance > (3)If nutch can do this ,how many nodes would be required? i'd like to know too :) -- F4FQM Kerunix Flan Laurent Laborde
