This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch TIKA-3226
in repository https://gitbox.apache.org/repos/asf/tika.git.
from 1dd15d9 TIKA-3226 -- WIP don't merge
add 4f90793 TIKA-3244: update spring
add 1141c5b TIKA-3244: update xerces, cxf and mockito
add 35ca575 TIKA-3244: update update joda, jackrabbit and jackson
add 8f66ce7 update copyright and ci management urls. thanks to lewismc
for identifying these.
add 24d801e Tika 3272 - Remove usage of rotation.py and Python dependency
(#397)
add 4768c67 TIKA-3244: update ctakes
new f8288e9 Merge remote-tracking branch 'origin/main' into TIKA-3226
new aedbbc2 TIKA-3226 -- unit tests pass
The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
NOTICE.txt | 2 +-
pom.xml | 2 +-
tika-parent/pom.xml | 16 +-
.../apache/tika/parser/ocr/ImagePreprocessor.java | 240 ++++++++-------------
.../apache/tika/parser/ocr/TesseractOCRConfig.java | 31 +--
.../apache/tika/parser/ocr/TesseractOCRParser.java | 18 +-
.../apache/tika/parser/ocr/tess4j/ImageDeskew.java | 153 +++++++++++++
.../apache/tika/parser/ocr/tess4j/ImageUtil.java | 113 ++++++++++
.../tika/parser/ocr/TesseractOCRConfig.properties | 9 +-
.../org/apache/tika/parser/ocr/rotation.py | 73 -------
.../tika/parser/ocr/TesseractOCRParserTest.java | 23 +-
.../{testRotated.png => testRotated+10.png} | Bin
.../resources/test-documents/testRotated-10.png | Bin 0 -> 674259 bytes
.../tika/config/TikaTranslatorConfigTest.java | 5 +-
.../org/apache/tika/server/client/TestBasic.java | 28 ++-
...itter.xml => tika-config-simple-fs-emitter.xml} | 2 +-
.../tika/server/core/resource/TikaResource.java | 4 +-
.../apache/tika/server/core/StackTraceOffTest.java | 2 +-
18 files changed, 427 insertions(+), 294 deletions(-)
create mode 100644
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-ocr-module/src/main/java/org/apache/tika/parser/ocr/tess4j/ImageDeskew.java
create mode 100644
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-ocr-module/src/main/java/org/apache/tika/parser/ocr/tess4j/ImageUtil.java
delete mode 100644
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-ocr-module/src/main/resources/org/apache/tika/parser/ocr/rotation.py
rename
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-ocr-module/src/test/resources/test-documents/{testRotated.png
=> testRotated+10.png} (100%)
create mode 100644
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-ocr-module/src/test/resources/test-documents/testRotated-10.png
rename
tika-server/tika-server-client/src/test/resources/{tika-config-simple-solr-emitter.xml
=> tika-config-simple-fs-emitter.xml} (97%)