While Apache Nutch 1.3 crawling pages, i want to analyze the content of
the page and if the content contains some keywords then adding page for
next steps, say indexing. If the content do not contains at least one
key, then just getting links from that page and ignoring it. How can i
do that? Is there any filtering plugin available for this purpose? Thnx.
- Nutch Content Filtering mausmust
- RE: Nutch Content Filtering Markus Jelsma
- Re: Nutch Content Filtering mausmust
- RE: Nutch Content Filtering Markus Jelsma
- RE: Nutch Content Filtering maus must
- Re: Nutch Content Filtering Alexander Chepurnoy

