Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException:

Tom Chiverton Fri, 14 Oct 2016 08:19:08 -0700

Not sure what was going on, so I deleted the core, and the underlyingfolders under solr/server/nutch, bounced the solr service, and theschema browser in the Solr interface shows now schema as expected.

If it put a single document it (i.e. a single URL in the seed list, thenrun inject, generate,fetch,parse,update and solrindex) then all is well.The schema browser in the Solr interface is showing the digest field asstring.

If I then run "bin/crawl ..." it adds some more documents (as expected)but ultimiatly dies with the ClassCastException. Like I have a baddocument in the index ?

But still, Solr schema browser is showing the digest field as string (asbefore) and my documents are listed (via the solr query web interface)as having string digests too !


Tom


On 14/10/16 15:57, Tom Chiverton wrote:

I don't understand what you mean here. I am not a Solr expert, thoughI've used it a bit in the past, though not with Nutch.
Is there a schema I should be feeding it ?

Tom


On 14/10/16 15:50, Markus Jelsma wrote:
Solr supports schemaless mode, which may be your case. Perhaps itmade your digest field multi valued. I'd suggest to use Solr'sclassic schema factory, and a fixed schema.
______________________________________________________________________
This email has been scanned by the Symantec Email Security.cloud service.
For more information please visit http://www.symanteccloud.com
______________________________________________________________________

Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException:

Reply via email to