Build Solr index using Hadoop MapReduce

Ning Li Mon, 02 Mar 2009 09:54:54 -0800

Hi,

I wonder if there is interest in a contrib module that builds Solr
index using Hadoop MapReduce?


It is different from the Solr support in Nutch. The Solr support in
Nutch sends a document to a Solr server in a reduce task. Here, I aim
at building/updating Solr index within map/reduce tasks. Also, it
achieves better parallelism when the number of map tasks is greater
than the number of reduce tasks, which is usually the case.

I worked out a very simple initial version. But I want to check if
there is any interest before proceeding. If so, I'll open a Jira
issue.

Cheers,
Ning

Build Solr index using Hadoop MapReduce

Reply via email to