Hi, I wonder if there is interest in a contrib module that builds Solr index using Hadoop MapReduce?
It is different from the Solr support in Nutch. The Solr support in Nutch sends a document to a Solr server in a reduce task. Here, I aim at building/updating Solr index within map/reduce tasks. Also, it achieves better parallelism when the number of map tasks is greater than the number of reduce tasks, which is usually the case. I worked out a very simple initial version. But I want to check if there is any interest before proceeding. If so, I'll open a Jira issue. Cheers, Ning
