[
https://issues.apache.org/jira/browse/VALIDATOR-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264601#comment-14264601
]
Sebb commented on VALIDATOR-235:
--------------------------------
Two possible approaches to implementing this:
1) Use java.net.IDN.toAscii() to convert the input to ASCII.
The advantage is that the existing REs and TLDs will then work with no further
change
However that requires Java 1.6 at run time, and at compile time unless
reflection is used.
2) Update the Regexes to allow Unicode alphabetic and numeric characters where
appropriate, and include the set of Unicode TLDs in the list.
This will work with Java 1.4 (compile and run).
The disadvantage is that it requires more data storage, and it's possible that
the REs won't exactly match the allowable range of Unicode characters.
> UrlValidator rejects url with german umlaut
> -------------------------------------------
>
> Key: VALIDATOR-235
> URL: https://issues.apache.org/jira/browse/VALIDATOR-235
> Project: Commons Validator
> Issue Type: Bug
> Affects Versions: 1.3.1 Release
> Reporter: Brian Preuß
>
> e.g. http://www.düsseldorf.de/
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)