[ 
https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13855551#comment-13855551
 ] 

Sebastian Nagel commented on NUTCH-1681:
----------------------------------------

Hi [~markus17], the solution fails for "http://uni-tübingen.de/"; (see 
NUTCH-1685). 
[IDN.toUnicode|http://download.java.net/jdk8/docs/api/java/net/IDN.html#toUnicode-java.lang.String-]
 seems only applicable to domain names not to URLs.

> In URLUtil.java, toUNICODE method does not work correctly
> ---------------------------------------------------------
>
>                 Key: NUTCH-1681
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1681
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 2.2.1
>            Reporter: İlhami KALKAN
>             Fix For: 1.9, 2.2.1
>
>         Attachments: NUTCH-1681-1.8.patch, NUTCH-1681-1.8.patch, 
> toUnicode.patch
>
>
> This method returns java.net.URISyntaxException when non-ascii character does 
> in parameter like http://www.çevir.com.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to