I've been reading the Nutch MapReduce stuff[1], and the original Google paper [2].

I know there's a mapreduce branch in the nutch project, but is there any plan/talk of perhaps integrating something like that directly into the Lucene API? For projects that need a lower-level API like Lucene, rather than the crawl-like nature of Nutch, the potential to index lots of information in an efficient manner is very appealing indeed.

I'm not suggesting this is _easy_, just curious of what folks on the Lucene-side of things think. Perhaps a chance to refactor out from nutch a shared library?

I would love to hear anyones thoughts on the matter.

cheers,

Paul Smith

[1] http://wiki.apache.org/nutch-data/attachments/Presentations/ attachments/oscon05.pdf
[2] http://labs.google.com/papers/mapreduce-osdi04.pdf

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to