Hadoop distributed search.

Trey Spiva Tue, 04 Dec 2007 09:21:29 -0800

According to a hadoop tutorial (http://wiki.apache.org/nutch/NutchHadoopTutorial) on wiki,

"you don't want to search using DFS, you want to search using localfilesystems. Once the index has been created on the DFS you canuse the hadoop copyToLocal command to move it to the local filesystem as such" ... "Understand that at this point we are not usingthe DFS or MapReduce to do the searching, all of it is on a localmachine".

So my understanding is that hadoop is only good for batch indexbuilding, and is not proper for incremental index building andsearch. Is this true?

The reason I am asking is that when I read the article ACM article byMike Cafarella and Doug Cutting, to me it sounded like the concernwas to make the index structures fit in the primary memory, not theentire crawled database. Did I miss understand the ACM article?

Hadoop distributed search.

Reply via email to