Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "MapReduce" page has been changed by AndreRicardo. http://wiki.apache.org/nutch/MapReduce?action=diff&rev1=6&rev2=7 -------------------------------------------------- * In essence, it allows massive data sets to be processed in a distributed fashion by breaking the processing into many small computations of two types: 1. A Map operation that transforms the input into an intermediate representation. - 2. A Reduce function that recombines the intermediate representation into the final output. + 1. A Reduce function that recombines the intermediate representation into the final output. - * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[http://wiki.apache.org/nutch-data/attachments/Presentations/attachments/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine. + * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[attachment:Presentations/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine.