[ 
https://issues.apache.org/jira/browse/NUTCH-1730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14609740#comment-14609740
 ] 

Markus Jelsma commented on NUTCH-1730:
--------------------------------------

Hello Sebastian!

* thanksI The unit tests are not affected as both have the same typo
* of course!
* yes, -1 disables it completely and 0 is a non-sensible depth as well

The use-case is that if you want to crawl many different hosts and not restrict 
them to the initial seed that was another host. You are right about linking to 
external deep page indeed. So this approach is flawed. Depth must always be 
controlled from the domain root!

> Scoring-depth optionally not to increment depth for external hosts
> ------------------------------------------------------------------
>
>                 Key: NUTCH-1730
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1730
>             Project: Nutch
>          Issue Type: New Feature
>    Affects Versions: 1.7
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.11
>
>         Attachments: NUTCH-1730-trunk.patch, NUTCH-1730.patch
>
>
> Currently, the plugin always increments depth, even when coming or going to 
> external hosts.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to