This is an automated email from the ASF dual-hosted git repository.

tallison pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/tika.git


    from 1bf0255429 TIKA-4671 - git add
     new 1a236d2a4e Revert "TIKA-4671 - git add"
     new c4d67657a0 Revert "TIKA-4671 - language aware charset detection"

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 docs/modules/ROOT/nav.adoc                         |   1 -
 docs/modules/ROOT/pages/advanced/index.adoc        |   1 -
 docs/modules/ROOT/pages/advanced/tika-eval.adoc    | 294 ---------------------
 .../tika/detect/CompositeEncodingDetector.java     | 155 ++---------
 .../tika/detect/DefaultEncodingDetector.java       |  27 +-
 .../tika/detect/EncodingDetectorContext.java       | 105 --------
 .../apache/tika/detect/MetaEncodingDetector.java   |  39 ---
 .../tika/language/detect/LanguageResult.java       |  28 --
 .../apache/tika/metadata/TikaCoreProperties.java   |   8 -
 .../charsoup/CharSoupEncodingDetector.java         | 186 -------------
 .../charsoup/CharSoupLanguageDetector.java         |  70 +----
 .../charsoup/CharSoupEncodingDetectorTest.java     | 183 -------------
 .../tika-parsers-standard-package/pom.xml          |   6 -
 .../tika/config/TikaEncodingDetectorTest.java      |  31 +--
 .../testArabicMisleadingCharset.html               |  11 -
 15 files changed, 33 insertions(+), 1112 deletions(-)
 delete mode 100644 docs/modules/ROOT/pages/advanced/tika-eval.adoc
 delete mode 100644 
tika-core/src/main/java/org/apache/tika/detect/EncodingDetectorContext.java
 delete mode 100644 
tika-core/src/main/java/org/apache/tika/detect/MetaEncodingDetector.java
 delete mode 100644 
tika-langdetect/tika-langdetect-charsoup/src/main/java/org/apache/tika/langdetect/charsoup/CharSoupEncodingDetector.java
 delete mode 100644 
tika-langdetect/tika-langdetect-charsoup/src/test/java/org/apache/tika/langdetect/charsoup/CharSoupEncodingDetectorTest.java
 delete mode 100644 
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/resources/test-documents/testArabicMisleadingCharset.html

Reply via email to