[ http://nagoya.apache.org/jira/browse/XERCESC-541?page=history ]
Alberto Massari updated XERCESC-541: ------------------------------------ Priority: Major > Regular Expressions : \w incorrectly matching punctuation characters > -------------------------------------------------------------------- > > Key: XERCESC-541 > URL: http://nagoya.apache.org/jira/browse/XERCESC-541 > Project: Xerces-C++ > Type: Bug > Components: Validating Parser (Schema) (Xerces 1.5 or up only) > Versions: 1.7.0 > Environment: Operating System: Other > Platform: PC > Reporter: Richard Schofield > Assignee: Xerces-C Developers Mailing List > > The XML Schema Spec Part 2 (Appendix F) defines the multi-charcater escapes > which can be used in regular expression matching. > \w should match all characters EXCEPT the set of "punctuation", "separator" > and "other" characters as defined by the unicode specification. > However, \w sets up a range which matches all characters between x0020 and > xD7FF (gXMLChars). This range results in the punctuation, separator and other > characters being matched incorrectly. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://nagoya.apache.org/jira/secure/Administrators.jspa - If you want more information on JIRA, or have a bug to report see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]