[ 
https://issues.apache.org/jira/browse/VALIDATOR-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264062#comment-14264062
 ] 

Sebb commented on VALIDATOR-235:
--------------------------------

There are two parts to this.

1) Syntax validation.
At present the Regex does not allow Unicode characters, because they are not 
permitted by RFC3986

2) Domain validation
At present the Unicode versions of TLDs are not included in the DomainValidator 
tables, and the code does not convert Unicode domains to punycode in order to 
check against the punycode entries.
There is no point doing either of these until the Regex issues are sorted.

Presumably the intention is to extend RFC3986 so that Alpha characters can now 
include Unicode characters.
Similarly for Alphanumerics. But there may be some exceptions. Need the 
relevant RFCs.

> UrlValidator rejects url with german umlaut
> -------------------------------------------
>
>                 Key: VALIDATOR-235
>                 URL: https://issues.apache.org/jira/browse/VALIDATOR-235
>             Project: Commons Validator
>          Issue Type: Bug
>    Affects Versions: 1.3.1 Release
>            Reporter: Brian Preuß
>
> e.g. http://www.düsseldorf.de/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to