This depends on where the term you search for is located on the page you
have fetched. For example, if your http.content.limit has truncated a large
page such as something which we often get from wikipedia, then you search
for something ***outwith*** that truncated part then you will not get the
snippet.

On Wed, Oct 5, 2011 at 11:14 AM, lewis john mcgibbney <
[email protected]> wrote:

> This depends on where the term you search for is located on the page you
> have fetched. For example, if your http.content.limit has truncated a large
> page such as something which we often get from wikipedia, then you search
> for something within that truncated part then you will not get the snippet.
>
> On the other hand, if you merely want Solr to display a snippet with the
> first n-number of lines of text from your page (which in my opinion is not
> much use to users unless they are searching for page titles rather than
> content) then yes this is possible within Nutch Solr set up.
>
> Where do you "see content filed but its..."? What do you mean by this?
>
> In the past I've just used parse tika and petty positive that given the
> correct Solr query you can quite easily obtain what you are after.
>
> On Wed, Oct 5, 2011 at 5:05 AM, abhayd <[email protected]> wrote:
>
>> hi
>> we crawl websites using nutch 1.3 and index is sent to solr.
>> Default schema provided with nutch does not have summary or snippet of
>> actual content. I see content filed but its all menu, header etc included.
>>
>> Can nutch create snippet which can be fed to solr? Or what needs to be
>> done
>> for creating snippet in search results page
>>
>> Any help?
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/where-is-the-snippet-tp3395557p3395557.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
>
>
>
> --
> *Lewis*
>
>


-- 
*Lewis*

Reply via email to