This depends on where the term you search for is located on the page you have fetched. For example, if your http.content.limit has truncated a large page such as something which we often get from wikipedia, then you search for something ***outwith*** that truncated part then you will not get the snippet.
On Wed, Oct 5, 2011 at 11:14 AM, lewis john mcgibbney < [email protected]> wrote: > This depends on where the term you search for is located on the page you > have fetched. For example, if your http.content.limit has truncated a large > page such as something which we often get from wikipedia, then you search > for something within that truncated part then you will not get the snippet. > > On the other hand, if you merely want Solr to display a snippet with the > first n-number of lines of text from your page (which in my opinion is not > much use to users unless they are searching for page titles rather than > content) then yes this is possible within Nutch Solr set up. > > Where do you "see content filed but its..."? What do you mean by this? > > In the past I've just used parse tika and petty positive that given the > correct Solr query you can quite easily obtain what you are after. > > On Wed, Oct 5, 2011 at 5:05 AM, abhayd <[email protected]> wrote: > >> hi >> we crawl websites using nutch 1.3 and index is sent to solr. >> Default schema provided with nutch does not have summary or snippet of >> actual content. I see content filed but its all menu, header etc included. >> >> Can nutch create snippet which can be fed to solr? Or what needs to be >> done >> for creating snippet in search results page >> >> Any help? >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/where-is-the-snippet-tp3395557p3395557.html >> Sent from the Nutch - User mailing list archive at Nabble.com. >> > > > > -- > *Lewis* > > -- *Lewis*

