Currently it looks we like don't have full support for such
functionality. It is straight foward to grab the nofollow rel tag but
the post processing is not currently implemented therefore you would
need to do this yourself.

Lewis

On Thu, Aug 16, 2012 at 5:27 AM, weishenyun <[email protected]> wrote:
> I know Nutch crawl the website according to Robot protocol if you make that
> configuration. And it will not fetch and parse the link on the page which
> contains <meta name="robots" content="nofollow">. But can Nutch process
> rel-tag likes rel="nofollow" in the tags  ......  on the page?
>
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Can-Nutch-process-rel-tag-likes-rel-nofollow-tp4001541.html
> Sent from the Nutch - Dev mailing list archive at Nabble.com.



-- 
Lewis

Reply via email to