[
https://issues.apache.org/jira/browse/VALIDATOR-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264062#comment-14264062
]
Sebb commented on VALIDATOR-235:
--------------------------------
There are two parts to this.
1) Syntax validation.
At present the Regex does not allow Unicode characters, because they are not
permitted by RFC3986
2) Domain validation
At present the Unicode versions of TLDs are not included in the DomainValidator
tables, and the code does not convert Unicode domains to punycode in order to
check against the punycode entries.
There is no point doing either of these until the Regex issues are sorted.
Presumably the intention is to extend RFC3986 so that Alpha characters can now
include Unicode characters.
Similarly for Alphanumerics. But there may be some exceptions. Need the
relevant RFCs.
> UrlValidator rejects url with german umlaut
> -------------------------------------------
>
> Key: VALIDATOR-235
> URL: https://issues.apache.org/jira/browse/VALIDATOR-235
> Project: Commons Validator
> Issue Type: Bug
> Affects Versions: 1.3.1 Release
> Reporter: Brian Preuß
>
> e.g. http://www.düsseldorf.de/
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)