[
https://issues.apache.org/jira/browse/XERCESC-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12613584#action_12613584
]
John Snelson commented on XERCESC-1816:
---------------------------------------
Maybe mistakenly, I was under the impression that the non-XML Schema mode of
the RegularExpression class implemented the XPath 2.0 / XQuery regular
expression syntax:
http://www.w3.org/TR/xpath-functions/#regex-syntax
So my questions are:
1) Does the non-XML Schema mode implement Perl semantics or XQuery semantics?
As far as I can see, Xerces-C itself only ever uses XML Schema mode.
2) If non-XML Schema mode is meant to be Perl semantics, are there objections
to adding an XQuery mode?
> Multi-character escape classes don't work correctly in regular expressions
> --------------------------------------------------------------------------
>
> Key: XERCESC-1816
> URL: https://issues.apache.org/jira/browse/XERCESC-1816
> Project: Xerces-C++
> Issue Type: Bug
> Components: Validating Parser (XML Schema)
> Affects Versions: 2.8.0, 3.0.0
> Reporter: John Snelson
>
> The regular expressions "\i", "\I", "\c" and "\C" do not work as specified in
> the XML Schema specification:
> http://www.w3.org/TR/xmlschema-2/#nt-MultiCharEsc
> In fact, "\I" and "\C" cause an infinite loop during the parsing of the
> regular expression, "\i" seems to only match the letter "i", and "\c" gives
> the error:
> A character in U+0040-U+005f must follow '\c'.
> I'd be happy to attempt to fix this bug, but I need some guidance as to what
> the code for "\c" is actually meant to be doing.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]