[ 
https://issues.apache.org/jira/browse/NUTCH-1727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sertac TURKEL updated NUTCH-1727:
---------------------------------

    Attachment: NUTCH-1727.patch

I had a look domain-suffix.xml  and I saw the longest domain suffix can include 
8 characters(.internal). By default value, I picked 8 for this reason and I 
prepared a patch.  Could you review my patch?

> Length of the Tlds
> ------------------
>
>                 Key: NUTCH-1727
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1727
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Sertac TURKEL
>            Priority: Minor
>             Fix For: 2.1
>
>         Attachments: NUTCH-1727.patch
>
>
> Length of the tld  should be selectable, there is some available tld's like 
> .travel and url-validator plugin filters this type of urls.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to