subcollection does work with 2.x and the problem was the configuration on my 
side (the subcollections.xml file in the conf folder).

So the list of fields in the "The Index Structure" page I can't confirm working 
with Nutch 2.x yet are: 

segment
primaryType
subtype
urlmeta


-----Original Message-----
From: Sullivan, James (Financial&Risk) 
Sent: Thursday, October 04, 2012 10:59 AM
To: [email protected]
Subject: RE: Nutch 2.1 fields

Lewis and Chris,

Agree that "The Index Structure" page is very useful documentation. I went 
through the fields/plugins listed in your link using Nutch 2.1 rc and most 
work. I was able to get positive results for everything except the following

segment -- I am guessing this is not relevant to Nutch 2.x?
primaryType -- does this work with Nutch 2.x or do I need to look into my 
configuration?
subtype -- does this work Nutch 2.x or is my set up misconfigured?
urlmeta -- does this work Nutch 2.x or is my set up misconfigured?
subcollection -- does this work with Nutch 2.x or is my set up misconfigured?

I notice the default installation of Nutch 2.x does not come with the urlmeta 
plugin nor the index-static plugin. 

It might be useful to add a column to that page about which fields work with 
1.x and which with 2.x. I think new users are finding it challenging to figure 
out which documentation applies to which version of Nutch.

James



-----Original Message-----
From: Christopher Gross [mailto:[email protected]]
Sent: Wednesday, October 03, 2012 3:59 AM
To: [email protected]
Subject: Re: Nutch 2.1 fields

Awesome, this is exactly what I needed.  Thanks Lewis!

-- Chris


On Tue, Oct 2, 2012 at 2:50 PM, Lewis John Mcgibbney 
<[email protected]> wrote:
> Hi Chris,
>
> Please see here [0] for the most up-to-date account of the fields for 
> building your Solr index.
>
> I tried to bring this bang up to date a while back and more recently 
> when writing some trivial plugin tests however please shout about 
> anything which is not correct and we can edit accordingly.
>
> hth
>
> Lewis
>
> [0] http://wiki.apache.org/nutch/IndexStructure
>
> On Tue, Oct 2, 2012 at 7:32 PM, Christopher Gross <[email protected]> wrote:
>> What fields are available to go in the solrindex-mapping.xml file for 
>> Nutch 2.1?  Is there a list somewhere?
>>
>> In my 1.3 setup, I had "url" -- I don't think I added anything in 
>> like a plugin to get that.
>>
>> -- Chris
>
>
>
> --
> Lewis

Reply via email to