Has anyone tried to apply/use the patches to the Nutch trunk from NUTCH-442? Between that code and the example from Sami's FooFactory weblog I've been able to at least get things running, but still hit a snag. When I try to run SolrIndexer.java I get an error from the Hadoop MapTask (via Indexer.java:157) complaining about a Type mismatch in the map "expected org.apache.hadoop.io.ObjectWritable, recieved org.apache.nutch.crawl.NutchWritable".

Looking at Indexer.java I can see where the OutputFormatter.map() seems to be trying to "send" a new NutchWritable, but SolrIndexer.index() sets up its JobConf map to use an ObjectWritable. I suspect that's where the problem is, but I'm not familiar enough with the code (so far) to be able to tell how to fix this.

I can post the errors/logs from it. I just wasn't sure which was relevant nor what the best way was (didn't want to just dump loads of lines of unformatted stack traces here).

--
This email message and any attachments are for the sole use of the intended
recipient(s) and may contain information that is proprietary to Ahold and/or
its subsidiaries ("Ahold") or otherwise confidential or legally privileged.
If you have received this message in error, please notify the sender by
reply, and delete all copies of this message and any attachments.  If you
are the intended recipient you may use the information contained in this
message and any files attached to this message only as authorized by Ahold.
Files attached to this message may only be transmitted using secure systems
and appropriate means of encryption, and must be secured using the same
level of password and security protection with which the file was provided
to you.  Any unauthorized use, dissemination or disclosure of this message
or its attachments is strictly prohibited.

Reply via email to