Do you mean they help when sharding? Thanks. Alex.
-----Original Message----- From: Lewis John Mcgibbney <[email protected]> To: user <[email protected]> Sent: Sat, Feb 16, 2013 10:58 am Subject: Re: fields in solrindex-mapping.xml In short, it helps with searching when you can slice your data using these fields On Saturday, February 16, 2013, Markus Jelsma <[email protected]> wrote: > Those are added by IndexerMapReduce (or 2.x equivalent) and index-basic. They contain the crawl datum's signature, the time stamp (see index-basic) and crawl datum score. If you think you don't need them, you can safely omit them. > > -----Original message----- >> From:[email protected] <[email protected]> >> Sent: Sat 16-Feb-2013 19:21 >> To: [email protected] >> Subject: Re: fields in solrindex-mapping.xml >> >> Hi Lewis, >> >> Why do we need to include digest, tstamp, boost and batchid fields in solrindex? >> >> Thanks. >> Alex. >> >> >> >> >> >> >> >> -----Original Message----- >> From: Lewis John Mcgibbney <[email protected]> >> To: user <[email protected]> >> Sent: Fri, Feb 15, 2013 4:21 pm >> Subject: Re: fields in solrindex-mapping.xml >> >> >> Hi Alex, >> OK so we can certainly remove segment from 2.x solr-index-mapping.xml. It >> would however be nice to replace this with the appropriate batchId. >> Can someone advise where the 'segment' field currently comes from in trunk? >> That way we can at least map the field to the batchId equivalent in 2.x >> >> Thank you >> Lewis >> >> On Fri, Feb 15, 2013 at 2:23 PM, <[email protected]> wrote: >> >> > Hi Lewis, >> > >> > If I exclude one of the fileds tstamp, digest, and boost from >> > solindex-mapping and schema.xml, solrindex gives error >> > >> > SEVERE: org.apache.solr.common.SolrException: ERROR: [doc=com.yahoo:http/] >> > unknown field 'tstamp' >> > >> > for each of above fields, except segment. >> > >> > Alex. >> > >> > >> > >> > >> > >> > >> > >> > -----Original Message----- >> > From: Lewis John Mcgibbney <[email protected]> >> > To: user <[email protected]> >> > Sent: Thu, Feb 14, 2013 8:34 pm >> > Subject: Re: fields in solrindex-mapping.xml >> > >> > >> > Hi Alex, >> > Tstamp represents fetch tiem, used for deduplication. >> > Boost is for scoring-opic and link. This is required in 2.x as well. >> > I don't have the code right now, but you can try removing digest and >> > segment. To me they both look legacy. >> > There is a wiki page on index structure which you can consult and/or add to >> > should you wish. >> > Thank you >> > Lewis >> > >> > On Thursday, February 14, 2013, <[email protected]> wrote: >> > > Hello, >> > > >> > > I see that there are >> > > >> > > <field dest="segment" source="segment"/> >> > > <field dest="boost" source="boost"/> >> > > <field dest="digest" source="digest"/> >> > > <field dest="tstamp" source="tstamp"/> >> > > >> > > fields in addition to title, host and content ones in nutch-2.x' >> > solr-mapping.xml. I thought tstamp may be needed for sorting documents. >> > What about the other fields, >> > > segment, boost and digest? Can someone explain, why these fields are >> > included in solr-mapping.xml? >> > > >> > > >> > > Thanks. >> > > Alex. >> > > >> > > >> > > >> > >> > -- >> > *Lewis* >> > >> > >> > >> >> >> -- >> *Lewis* >> >> >> > -- *Lewis*

