I'm not familiar with that plugin. DiscoverEd adds an aggregate step
to the crawl process to handle this sort of situation. For our
deployment, the aggregate steps polls the RSS feed and adds the
resources it finds to an RDF store, which can then be used to generate
the seed. You may want to ask on the nutch-users email list for Nutch
specific questions.

Regards,

Nathan


On Sat, Oct 23, 2010 at 1:47 PM, Israel <[email protected]> wrote:
> Hi,
>
> I have this problem: My Nutch read rss, for example RSS:
>
>
> http://www.edutube.org/en/taxonomy/term/7/feed
>
> But I want that Nutch perform searches within the links (links),.... and
> that not return the RSS in search results.... that is, not return a link
> with the same page
>
> http://www.edutube.org/en/taxonomy/term/7/feed
>
>
> Somebody have the "SyndFeedPoller" plugin; or know how i can do it??
>
> Thanks.
>
> _______________________________________________
> cc-devel mailing list
> [email protected]
> http://lists.ibiblio.org/mailman/listinfo/cc-devel
>
>
_______________________________________________
cc-devel mailing list
[email protected]
http://lists.ibiblio.org/mailman/listinfo/cc-devel

Reply via email to