about snippets (with attachement)
A: user@nutch.apache.org
that's can be good?
http://192.168.1.5:8983/WoWSolrWebApp/search?query=giocosubmit=Search
Il giorno 06 aprile 2012 22:29, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com ha scritto:
It would be easier if you could provide an URL
or this:
http://pc-alessio:8983/*WoWSolrWebApp/search?query=giocosubmit=Search*
-- Messaggio inoltrato --
Da: alessio crisantemi alessio.crisant...@gmail.com
Date: 06 aprile 2012 22:42
Oggetto: Re: request about snippets (with attachement)
A: user@nutch.apache.org
From the limited HTML that I've seen I can only assume that the offending
xhtml is in the content field.
If this is the case then you will need to write a custom plugin
implementation that removes this. There is loads of info allowing you to
get up to speed with plugins on our wiki.[0]
Once you
thank you agin Lewis,
but do you think that my strange content field it's for my cause?
beacuse I disabled the indexing of about all field.
this is my schema:
fields
field name=id type=string stored=true indexed=true/
!-- core fields --
field name=segment type=string
It would be easier if you could provide an URL and people can see exactly
what you are struggling with please?
2012/4/6 alessio crisantemi alessio.crisant...@gmail.com
any suggestions for my cause?
Il giorno 05 aprile 2012 23:20, alessio crisantemi
alessio.crisant...@gmail.com ha scritto:
that's can be good?
http://192.168.1.5:8983/WoWSolrWebApp/search?query=giocosubmit=Search
Il giorno 06 aprile 2012 22:29, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com ha scritto:
It would be easier if you could provide an URL and people can see exactly
what you are struggling with please?
or this:
http://pc-alessio:8983/*WoWSolrWebApp/search?query=giocosubmit=Search*
-- Messaggio inoltrato --
Da: alessio crisantemi alessio.crisant...@gmail.com
Date: 06 aprile 2012 22:42
Oggetto: Re: request about snippets (with attachement)
A: user@nutch.apache.org
that's can
-- Messaggio inoltrato --
Da: alessio crisantemi alessio.crisant...@gmail.com
Date: 05 aprile 2012 22:32
Oggetto: request about snippets
A: user@nutch.apache.org
Dear all,
I configured my Nutch (1.4) for works with Solr (1.4.1) and I crawl and
index with success my website.
I
Hi Alessio,
You need to determine in which field the unwanted content exists. Once
you've done this you could write an indexing filter to remove this from
your document prior to indexing.
Lewis
On Thu, Apr 5, 2012 at 9:41 PM, alessio crisantemi
alessio.crisant...@gmail.com wrote:
Dear Lewis, thank you for your fast reply.
But just thiat's my problem! I don't compred wich is the field that crates
this raw.
But I see a date (eg: Mercoledì Apr 04) followed by the word parent
anche after and the the ame of categories (Home NEWSLOT/VLT SCOMMESSE
ONLINE LOTTERIE Politica Video
I can't see any of your attachments as they're not permitted on list.
Can you provide an URL?
On Thu, Apr 5, 2012 at 9:56 PM, alessio crisantemi
alessio.crisant...@gmail.com wrote:
Dear Lewis, thank you for your fast reply.
But just thiat's my problem! I don't compred wich is the field that
Seems to me it's just the breadcrumb of the page popping up in Solr's
highlighter snippet?
In Thu, 5 Apr 2012 22:02:31 +0100, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
I can't see any of your attachments as they're not permitted on list.
Can you provide an URL?
On Thu, Apr 5,
what is it 'breadcrumb' Markus?
Il giorno 05 aprile 2012 23:08, Markus Jelsma
markus.jel...@openindex.ioha scritto:
Seems to me it's just the breadcrumb of the page popping up in Solr's
highlighter snippet?
In Thu, 5 Apr 2012 22:02:31 +0100, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com
here a part of results:
[2] Live Score - GiocoNews - Tutto su casinò, poker, giochi
onlinehttp://www.gioconews.it/live-score.html Live
Score - *Gioco*News - Tutto su casinò, poker, giochi online Mercoledì Apr
04 Home NEWSLOT/VLT SCOMMESSE ONLINE LOTTERIE Politica Video Live Score
Home Live
14 matches
Mail list logo