Re: request about snippets (with attachement)

2012-04-07 Thread Lewis John Mcgibbney
about snippets (with attachement) A: user@nutch.apache.org that's can be good? http://192.168.1.5:8983/WoWSolrWebApp/search?query=giocosubmit=Search Il giorno 06 aprile 2012 22:29, Lewis John Mcgibbney lewis.mcgibb...@gmail.com ha scritto: It would be easier if you could provide an URL

Re: request about snippets (with attachement)

2012-04-07 Thread alessio crisantemi
or this: http://pc-alessio:8983/*WoWSolrWebApp/search?query=giocosubmit=Search* -- Messaggio inoltrato -- Da: alessio crisantemi alessio.crisant...@gmail.com Date: 06 aprile 2012 22:42 Oggetto: Re: request about snippets (with attachement) A: user@nutch.apache.org

Re: request about snippets (with attachement)

2012-04-07 Thread Lewis John Mcgibbney
From the limited HTML that I've seen I can only assume that the offending xhtml is in the content field. If this is the case then you will need to write a custom plugin implementation that removes this. There is loads of info allowing you to get up to speed with plugins on our wiki.[0] Once you

Re: request about snippets (with attachement)

2012-04-07 Thread alessio crisantemi
thank you agin Lewis, but do you think that my strange content field it's for my cause? beacuse I disabled the indexing of about all field. this is my schema: fields field name=id type=string stored=true indexed=true/ !-- core fields -- field name=segment type=string

Re: request about snippets (with attachement)

2012-04-06 Thread Lewis John Mcgibbney
It would be easier if you could provide an URL and people can see exactly what you are struggling with please? 2012/4/6 alessio crisantemi alessio.crisant...@gmail.com any suggestions for my cause? Il giorno 05 aprile 2012 23:20, alessio crisantemi alessio.crisant...@gmail.com ha scritto:

Re: request about snippets (with attachement)

2012-04-06 Thread alessio crisantemi
that's can be good? http://192.168.1.5:8983/WoWSolrWebApp/search?query=giocosubmit=Search Il giorno 06 aprile 2012 22:29, Lewis John Mcgibbney lewis.mcgibb...@gmail.com ha scritto: It would be easier if you could provide an URL and people can see exactly what you are struggling with please?

Fwd: request about snippets (with attachement)

2012-04-06 Thread alessio crisantemi
or this: http://pc-alessio:8983/*WoWSolrWebApp/search?query=giocosubmit=Search* -- Messaggio inoltrato -- Da: alessio crisantemi alessio.crisant...@gmail.com Date: 06 aprile 2012 22:42 Oggetto: Re: request about snippets (with attachement) A: user@nutch.apache.org that's can

Fwd: request about snippets (with attachement)

2012-04-05 Thread alessio crisantemi
-- Messaggio inoltrato -- Da: alessio crisantemi alessio.crisant...@gmail.com Date: 05 aprile 2012 22:32 Oggetto: request about snippets A: user@nutch.apache.org Dear all, I configured my Nutch (1.4) for works with Solr (1.4.1) and I crawl and index with success my website. I

Re: request about snippets (with attachement)

2012-04-05 Thread Lewis John Mcgibbney
Hi Alessio, You need to determine in which field the unwanted content exists. Once you've done this you could write an indexing filter to remove this from your document prior to indexing. Lewis On Thu, Apr 5, 2012 at 9:41 PM, alessio crisantemi alessio.crisant...@gmail.com wrote:

Re: request about snippets (with attachement)

2012-04-05 Thread alessio crisantemi
Dear Lewis, thank you for your fast reply. But just thiat's my problem! I don't compred wich is the field that crates this raw. But I see a date (eg: Mercoledì Apr 04) followed by the word parent anche after and the the ame of categories (Home NEWSLOT/VLT SCOMMESSE ONLINE LOTTERIE Politica Video

Re: request about snippets (with attachement)

2012-04-05 Thread Lewis John Mcgibbney
I can't see any of your attachments as they're not permitted on list. Can you provide an URL? On Thu, Apr 5, 2012 at 9:56 PM, alessio crisantemi alessio.crisant...@gmail.com wrote: Dear Lewis, thank you for your fast reply. But just thiat's my problem! I don't compred wich is the field that

Re: request about snippets (with attachement)

2012-04-05 Thread Markus Jelsma
Seems to me it's just the breadcrumb of the page popping up in Solr's highlighter snippet? In Thu, 5 Apr 2012 22:02:31 +0100, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: I can't see any of your attachments as they're not permitted on list. Can you provide an URL? On Thu, Apr 5,

Re: request about snippets (with attachement)

2012-04-05 Thread alessio crisantemi
what is it 'breadcrumb' Markus? Il giorno 05 aprile 2012 23:08, Markus Jelsma markus.jel...@openindex.ioha scritto: Seems to me it's just the breadcrumb of the page popping up in Solr's highlighter snippet? In Thu, 5 Apr 2012 22:02:31 +0100, Lewis John Mcgibbney lewis.mcgibb...@gmail.com

Re: request about snippets (with attachement)

2012-04-05 Thread alessio crisantemi
here a part of results: [2] Live Score - GiocoNews - Tutto su casinò, poker, giochi onlinehttp://www.gioconews.it/live-score.html Live Score - *Gioco*News - Tutto su casinò, poker, giochi online Mercoledì Apr 04 Home NEWSLOT/VLT SCOMMESSE ONLINE LOTTERIE Politica Video Live Score Home Live