[
https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13847346#comment-13847346
]
Markus Jelsma commented on NUTCH-1681:
--------------------------------------
Hi İlhami, this won't fix the toUNICODE() method. If you input a Punycoded host
it will return as Punycode, not normal readable Unicode.
{code}
assertEquals("http://www.çevir.com",
URLUtil.toUNICODE("http://www.xn--evir-zoa.com"));
{code}
{code}
expected:<...çevir...> but was:<...xn--evir-zoa...>
junit.framework.ComparisonFailure: expected:<...çevir...> but
was:<...xn--evir-zoa...>
{code}
Also, can you upload patches at ASF via SVN and not in git-format?
> In URLUtil.java, toUNICODE method does not work correctly
> ---------------------------------------------------------
>
> Key: NUTCH-1681
> URL: https://issues.apache.org/jira/browse/NUTCH-1681
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 2.2.1
> Reporter: İlhami KALKAN
> Fix For: 2.2.1
>
> Attachments: toUnicode.patch
>
>
> This method returns java.net.URISyntaxException when non-ascii character does
> in parameter like http://www.çevir.com.
--
This message was sent by Atlassian JIRA
(v6.1.4#6159)