This is an automated email from the ASF dual-hosted git repository.

lfcnassif pushed a commit to branch 2.x
in repository https://gitbox.apache.org/repos/asf/tika.git

The following commit(s) were added to refs/heads/2.x by this push:
       new  73147a2   update javadoc for Latin1StringsParser
73147a2 is described below

commit 73147a23913f777d6bc5dff62633ad41970f0a4c
Author: lfcnassif <[email protected]>
AuthorDate: Wed Apr 19 20:20:13 2017 -0300

    update javadoc for Latin1StringsParser
---
 .../apache/tika/parser/strings/Latin1StringsParser.java   | 15 ++++++++++-----
 1 file changed, 10 insertions(+), 5 deletions(-)

diff --git 
a/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
 
b/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
index 5c6fb46..b2ea40b 100644
--- 
a/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
+++ 
b/tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/strings/Latin1StringsParser.java
@@ -28,13 +28,18 @@ import org.xml.sax.ContentHandler;
 import org.xml.sax.SAXException;
 
 /**
- * Parser to extract printable Latin1 strings from arbitrary files with pure
- * java. Useful for binary or unknown files, for files without a specific 
parser
- * and for corrupted ones causing a TikaException as a fallback parser.
- * 
+ * Parser to extract printable Latin1 strings from arbitrary files with pure 
java
+ * without running any external process. Useful for binary or unknown files, 
for
+ * files without a specific parser and for corrupted ones causing a 
TikaException
+ * as a fallback parser. To enable the parsing of unknown or files without a
+ * specific parser with AutoDetectParser:
+ * <p>
+ * AutoDetectParser parser = new AutoDetectParser();
+ * parser.setFallback(new Latin1StringsParser());
+ * </p>
  * Currently the parser does a best effort to extract Latin1 strings, used by
  * Western European languages, encoded with ISO-8859-1, UTF-8 or UTF-16 
charsets
- * within the same file.
+ * mixed within the same file.
  * 
  * The implementation is optimized for fast parsing with only one pass.
  */

-- 
To stop receiving notification emails like this one, please contact
['"[email protected]" <[email protected]>'].

Reply via email to