[ http://nagoya.apache.org/jira/browse/XERCESC-541?page=history ]

Alberto Massari updated XERCESC-541:
------------------------------------

    Priority: Major

> Regular Expressions : \w incorrectly matching punctuation characters
> --------------------------------------------------------------------
>
>          Key: XERCESC-541
>          URL: http://nagoya.apache.org/jira/browse/XERCESC-541
>      Project: Xerces-C++
>         Type: Bug
>   Components: Validating Parser (Schema) (Xerces 1.5 or up only)
>     Versions: 1.7.0
>  Environment: Operating System: Other
> Platform: PC
>     Reporter: Richard Schofield
>     Assignee: Xerces-C Developers Mailing List

>
> The XML Schema Spec Part 2 (Appendix F) defines the multi-charcater escapes 
> which can be used in regular expression matching.
> \w should match all characters EXCEPT the set of "punctuation", "separator" 
> and "other" characters as defined by the unicode specification.
> However, \w sets up a range which matches all characters between x0020 and 
> xD7FF (gXMLChars). This range results in the punctuation, separator and other 
> characters being matched incorrectly.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://nagoya.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to