Hi Lewis, I managed to crawl with nutch 1.6 and index into Solr 4.2 after I add the following to the schema-solr4.xml :
<field name="_version_" type="long" stored="true" indexed="true"/> <uniqueKey>url</uniqueKey> I saw "_version_" was added in the patch but I didn't see the uniqueKey update from id to url (maybe it's not mandatory ? I didn't try without it) I'm also using metatag (metatag.keywords, metatag.description) fields and for that I think the only difference is change the type from "text" to "string". When I'll get things running in full I'll publish my configurations if that would help. Thanks. On Mon, Apr 8, 2013 at 10:31 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi Amit, > I recently updated NUTCH-1486 [0] with a patch to work against Solr 4.2.1. > You will be able to pull stuff from this patch and push it into your Solr 4 > schema file, etc. > I will begin work on upgrading trunk to work with Solr 4 shortly... maybe > this afternoon. > If you are able to help with the effort it would be great. > Thank you > > [0] https://issues.apache.org/jira/browse/NUTCH-1486 > > > On Sat, Apr 6, 2013 at 8:25 AM, Amit Sela <[email protected]> wrote: > > > Hi all, > > > > I have nutch 1.6 setup and running with Solr 3.6.2 and I'm trying to > > upgrade to Solr 4.2 but I'm missing something... > > > > I re-built nutch with schema-solr4.xml as schema.xml and copied the > > schema-solr4.xml to Solr example/collection1/conf/schema.xml > > > > The index phase keeps failing throwing errors about unknown field "host" > > and "metatag.description" (metatags worked just fine with 3.6.2). > > > > What else am I missing ? > > > > Thanks. > > > > > > -- > *Lewis* >

