Those are added by IndexerMapReduce (or 2.x equivalent) and index-basic. They 
contain the crawl datum's signature, the time stamp (see index-basic) and crawl 
datum score. If you think you don't need them, you can safely omit them. 
 
-----Original message-----
> From:[email protected] <[email protected]>
> Sent: Sat 16-Feb-2013 19:21
> To: [email protected]
> Subject: Re: fields in solrindex-mapping.xml
> 
> Hi Lewis,
> 
> Why do we need to include digest, tstamp, boost and batchid fields in 
> solrindex?
> 
> Thanks.
> Alex.
> 
>  
> 
>  
> 
>  
> 
> -----Original Message-----
> From: Lewis John Mcgibbney <[email protected]>
> To: user <[email protected]>
> Sent: Fri, Feb 15, 2013 4:21 pm
> Subject: Re: fields in solrindex-mapping.xml
> 
> 
> Hi Alex,
> OK so we can certainly remove segment from 2.x solr-index-mapping.xml. It
> would however be nice to replace this with the appropriate batchId.
> Can someone advise where the 'segment' field currently comes from in trunk?
> That way we can at least map the field to the batchId equivalent in 2.x
> 
> Thank you
> Lewis
> 
> On Fri, Feb 15, 2013 at 2:23 PM, <[email protected]> wrote:
> 
> > Hi Lewis,
> >
> > If I exclude one of the fileds tstamp, digest, and boost from
> > solindex-mapping and schema.xml, solrindex gives error
> >
> > SEVERE: org.apache.solr.common.SolrException: ERROR: [doc=com.yahoo:http/]
> > unknown field 'tstamp'
> >
> > for each of above fields, except segment.
> >
> > Alex.
> >
> >
> >
> >
> >
> >
> >
> > -----Original Message-----
> > From: Lewis John Mcgibbney <[email protected]>
> > To: user <[email protected]>
> > Sent: Thu, Feb 14, 2013 8:34 pm
> > Subject: Re: fields in solrindex-mapping.xml
> >
> >
> > Hi Alex,
> > Tstamp represents fetch tiem, used for deduplication.
> > Boost is for scoring-opic and link. This is required in 2.x as well.
> > I don't have the code right now, but you can try removing digest and
> > segment. To me they both look legacy.
> > There is a wiki page on index structure which you can consult and/or add to
> > should you wish.
> > Thank you
> > Lewis
> >
> > On Thursday, February 14, 2013,  <[email protected]> wrote:
> > > Hello,
> > >
> > > I see that there are
> > >
> > >                 <field dest="segment" source="segment"/>
> > >                 <field dest="boost" source="boost"/>
> > >                 <field dest="digest" source="digest"/>
> > >                 <field dest="tstamp" source="tstamp"/>
> > >
> > > fields in addition to title, host and content ones in nutch-2.x'
> > solr-mapping.xml. I thought tstamp may be needed for sorting documents.
> > What about the other fields,
> > > segment, boost and digest? Can someone explain, why these fields are
> > included in solr-mapping.xml?
> > >
> > >
> > > Thanks.
> > > Alex.
> > >
> > >
> > >
> >
> > --
> > *Lewis*
> >
> >
> >
> 
> 
> -- 
> *Lewis*
> 
>  
> 

Reply via email to