[ 
https://issues.apache.org/jira/browse/XERCESJ-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Richard Kelly updated XERCESJ-1383:
-----------------------------------

    Attachment: test.xml
                CharacterNormalizer.zip

This archive contains the Character Normalization XNI component, the ICU4J 
normalizer component and patches needed to use the component.

DOM Level 3 Core: "normalize-characters" [1] 
DOM Level 3 Core: "check-character-normalization" [2] 
SAX2: "unicode-normalization-checking" [3] 

Once compiled, the following commands can be used demonstrate this component 
with the supplied "test.xml" file:
 
java -classpath xercesSamples.jar;xercesImpl.jar;icu4j.jar sax.Counter -u 
test.xml
java -classpath xercesSamples.jar;xercesImpl.jar;icu4j.jar;xml-apis.jar 
dom.DOM3 test.xml

[1] 
http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters 
[2] 
http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization
 
[3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html 


> Adding Unicode Normalization support to Xerces2-J 
> --------------------------------------------------
>
>                 Key: XERCESJ-1383
>                 URL: https://issues.apache.org/jira/browse/XERCESJ-1383
>             Project: Xerces2-J
>          Issue Type: New Feature
>          Components: DOM (Level 3 Core), SAX
>    Affects Versions: 2.9.1
>         Environment: All
>            Reporter: Richard Kelly
>            Assignee: Michael Glavassevich
>         Attachments: CharacterNormalizer.java, CharacterNormalizer.patch, 
> CharacterNormalizer.zip, test.xml
>
>
> This feature will add support for Unicode character normalization and 
> normalization checking to Xerces.  Applications that use Xerces will be able 
> to produce fully normalized XML documents and verify that any XML documents 
> they process are fully normalised. 
> Adding this functionality will allow Xerces to meet the XML 1.1 W3C 
> Recommendation regarding character normalization and allow it to implement 
> the optional character normalization and normalization checking features 
> specified in the DOM Level 3 Core and SAX2.
> More specifically, the features to be implemented are:
> DOM Level 3 Core: "normalize-characters" [1]
> DOM Level 3 Core: "check-character-normalization" [2]
> SAX2: "unicode-normalization-checking" [3]
> [1] 
> http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters
> [2] 
> http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization
> [3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to