[ http://nagoya.apache.org/jira/browse/XERCESC-541?page=history ]
Alberto Massari updated XERCESC-541:
------------------------------------
Priority: Major
> Regular Expressions : \w incorrectly matching punctuation characters
> --------------------------------------------------------------------
>
> Key: XERCESC-541
> URL: http://nagoya.apache.org/jira/browse/XERCESC-541
> Project: Xerces-C++
> Type: Bug
> Components: Validating Parser (Schema) (Xerces 1.5 or up only)
> Versions: 1.7.0
> Environment: Operating System: Other
> Platform: PC
> Reporter: Richard Schofield
> Assignee: Xerces-C Developers Mailing List
>
> The XML Schema Spec Part 2 (Appendix F) defines the multi-charcater escapes
> which can be used in regular expression matching.
> \w should match all characters EXCEPT the set of "punctuation", "separator"
> and "other" characters as defined by the unicode specification.
> However, \w sets up a range which matches all characters between x0020 and
> xD7FF (gXMLChars). This range results in the punctuation, separator and other
> characters being matched incorrectly.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://nagoya.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]