This is an automated email from the ASF dual-hosted git repository.

tallison pushed a commit to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git


The following commit(s) were added to refs/heads/branch_1x by this push:
     new 35d87d1  TIKA-2580 via Ewan Mellor.
35d87d1 is described below

commit 35d87d1f0a1594ea6c738dc8f50d51d0dad09501
Author: tballison <talli...@mitre.org>
AuthorDate: Thu Feb 22 09:32:35 2018 -0500

    TIKA-2580 via Ewan Mellor.
---
 tika-core/src/main/java/org/apache/tika/sax/SafeContentHandler.java | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git 
a/tika-core/src/main/java/org/apache/tika/sax/SafeContentHandler.java 
b/tika-core/src/main/java/org/apache/tika/sax/SafeContentHandler.java
index d3152c6..9f2be69 100644
--- a/tika-core/src/main/java/org/apache/tika/sax/SafeContentHandler.java
+++ b/tika-core/src/main/java/org/apache/tika/sax/SafeContentHandler.java
@@ -31,7 +31,8 @@ import org.xml.sax.helpers.AttributesImpl;
  * ({@link #characters(char[], int, int)} or
  * {@link #ignorableWhitespace(char[], int, int)}) passed to the decorated
  * content handler contain only valid XML characters. All invalid characters
- * are replaced with spaces.
+ * are replaced with the Unicode replacement character U+FFFD (though a
+ * subclass may change this by overriding the {@link 
#writeReplacement(Output)}  method).
  * <p>
  * The XML standard defines the following Unicode character ranges as
  * valid XML characters:

-- 
To stop receiving notification emails like this one, please contact
talli...@apache.org.

Reply via email to