[ 
https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13847346#comment-13847346
 ] 

Markus Jelsma commented on NUTCH-1681:
--------------------------------------

Hi İlhami, this won't fix the toUNICODE() method. If you input a Punycoded host 
it will return as Punycode, not normal readable Unicode.

{code}
assertEquals("http://www.çevir.com";, 
URLUtil.toUNICODE("http://www.xn--evir-zoa.com";));
{code}
{code}
expected:<...çevir...> but was:<...xn--evir-zoa...>
junit.framework.ComparisonFailure: expected:<...çevir...> but 
was:<...xn--evir-zoa...>
{code}

Also, can you upload patches at ASF via SVN and not in git-format?

> In URLUtil.java, toUNICODE method does not work correctly
> ---------------------------------------------------------
>
>                 Key: NUTCH-1681
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1681
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 2.2.1
>            Reporter: İlhami KALKAN
>             Fix For: 2.2.1
>
>         Attachments: toUnicode.patch
>
>
> This method returns java.net.URISyntaxException when non-ascii character does 
> in parameter like http://www.çevir.com.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to