[ 
https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-2220:
---------------------------------
    Attachment: NUTCH-2220.patch

Patch for trunk

> Rename db.* options used only by the linkdb to linkdb.*
> -------------------------------------------------------
>
>                 Key: NUTCH-2220
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2220
>             Project: Nutch
>          Issue Type: Task
>          Components: linkdb
>    Affects Versions: 1.11
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.12
>
>         Attachments: NUTCH-2220.patch
>
>
> We need an option db.ignore.internal.links that operates in FetcherThread, 
> just like db.ignore.external.links. It already exists but it only used by the 
> LinkDB, and defaults to true, which is no good option for FetcherThread.
> I propose to make a clear distinction between which are used for LinkDB or 
> not. Most options used by LinkDB already use the right prefix but 
> db.ignore.*.links, db.max.inlinks and db.max.anchor.length not yet.
> This patch will rename those options to linkdb.* prefixes so afterwards we 
> can implement db.ignore.internal.links that operates in FetcherThread, just 
> like db.ignore.external.links.
> This will introduce a change in default parameters. Please comment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to