On 08/01/2024 16:28, Eric J. Schwarzenbach wrote:
Hello, Does anybody have a patch for https://issues.apache.org/jira/browse/XALANJ-2560 That Xalan produces invalid XML with some utf-8 characters seems rather serious. I find putting 💻 or the literal character it represents into an XML document and running it through any XML-to-XML transform results in it being replaced with �� in the output which evidently makes the XML invalid. I tried a change to ToStream.java from https://issues.apache.org/jira/browse/XALANJ-2419 with the source of Xalan 2.7.3 but it did not help.
Use Saxon, perhaps, or see whether https://stackoverflow.com/a/74245232/252228 helps for patching Xalan.