[
https://issues.apache.org/jira/browse/TIKA-2273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15881341#comment-15881341
]
Hudson commented on TIKA-2273:
------------------------------
SUCCESS: Integrated in Jenkins build Tika-trunk #1209 (See
[https://builds.apache.org/job/Tika-trunk/1209/])
TIKA-2273 -- two tests turned off temporarily in bundle. First draft of
(tallison: rev 6d022be03b5423f6c036e1aa45e4ce02a9678462)
* (edit) tika-core/src/main/java/org/apache/tika/detect/EncodingDetector.java
* (edit) tika-core/src/main/java/org/apache/tika/parser/DefaultParser.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/JackcessExtractor.java
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaConfig.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java
* (edit)
tika-core/src/main/java/org/apache/tika/extractor/EmbeddedDocumentUtil.java
* (add)
tika-core/src/main/java/org/apache/tika/parser/AbstractEncodingDetectorParser.java
* (add)
tika-core/src/main/java/org/apache/tika/detect/DefaultEncodingDetector.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/txt/Icu4jEncodingDetector.java
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/html/HtmlParser.java
* (add)
tika-core/src/main/java/org/apache/tika/detect/CompositeEncodingDetector.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/isatab/ISATabUtils.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java
* (add)
tika-parsers/src/test/resources/org/apache/tika/config/TIKA-2273-blacklist-encoding-detector-default.xml
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/txt/TXTParser.java
* (edit) tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java
* (add)
tika-parsers/src/test/java/org/apache/tika/config/TikaEncodingDetectorTest.java
* (add)
tika-parsers/src/test/resources/org/apache/tika/config/TIKA-2273-parameterize-encoding-detector.xml
* (edit) tika-core/src/main/java/org/apache/tika/detect/AutoDetectReader.java
* (add)
tika-core/src/main/java/org/apache/tika/detect/NonDetectingEncodingDetector.java
* (edit) tika-core/src/test/java/org/apache/tika/TikaTest.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/txt/TXTParserTest.java
* (add)
tika-parsers/src/test/resources/org/apache/tika/config/TIKA-2273-no-icu4j-encoding-detector.xml
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/chm/ChmParser.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/code/SourceCodeParser.java
> Enable configuration of EncodingDetectors via TikaConfig
> --------------------------------------------------------
>
> Key: TIKA-2273
> URL: https://issues.apache.org/jira/browse/TIKA-2273
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> Attachments: TIKA_2273_first_draft.patch
>
>
> It would be nice to allow easier configuration of encoding detectors. It
> should be straightforward to follow the example of detectors...(famous last
> words).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)