This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch TIKA-4522
in repository https://gitbox.apache.org/repos/asf/tika.git
from b06877bce TIKA-4512 -- clean up dependencies in file-system-fetcher
add a9854028b TIKA-4327: update aws, netty
add f370d845f TIKA-4327: revert netty update
add b8b869e89 TIKA-4327: update azure-identity, reactor
add 8b0dd1fa1 TIKA-4517 -- fix windows
add 17826b53e TIKA-4518 (#2366)
add e4da37eba TIKA-4518 -- escape quotes in filenames for better cross
platform robustness
add 0983c54e3 TIKA-4327: update aws, netty
add cc45d250b TIKA-4327: update junrar
add 307248b48 TIKA-4327: update junrar
add 64c7d9020 TIKA-4327: update aws
add 0bc03abab TIKA-4524: migrate to aws v2
add 7a614f669 TIKA-4524: try using requestChecksumCalculation instead of
checksum
add f0f109cb2 TIKA-4327: update websockets
add b4314bdf6 TIKA-4327: update aws
add 6e3bdd9ba TIKA-4524: remove commented code
add 5af2992bd TIKA-4327: update rat; prepare for testcontainers 2
add 8d4b6e824 TIKA-4504: add exclusions
add d217fa897 TIKA-4504: add exclusions
add 7c5260f3f TIKA-4488: avoid shaded testcontainers classes that are gone
in v2
add a76b0f591 TIKA-4504: add exclusions
add bf023d288 TIKA-4327: update testcontainers to v2
add a78ad6048 TIKA-4327: update spring
add 879d1a34d TIKA-4488: avoid shaded testcontainers classes
add 6800faa44 TIKA-4525: migrate to aws v2
add d21a66fb7 TIKA-4525: migrate to aws v2
add db392986f TIKA-4525: migrate to aws v2
add efb956ad8 Bump pdfbox.version from 3.0.5 to 3.0.6 (#2372)
add 764d1af5e Bump org.codehaus.mojo:exec-maven-plugin from 3.6.1 to 3.6.2
(#2371)
add 1dfd9cbb2 Bump com.puppycrawl.tools:checkstyle from 12.0.1 to 12.1.0
(#2369)
add 5d6ede477 TIKA-4525: minor cosmetic fixes
add 7f10ce33c TIKA-4525: migrate to aws v2
add 3c0c67cdd TIKA-4525: migrate to aws v2
add af0990ae3 TIKA-4525: restore exception handling (should have read the
comment!)
add 7223eebaf TIKA-4525: migrate to aws v2
add 2ddcb0e5b TIKA-4525: remove aws v1
add 30fd3cd93 TIKA-4525: restore aws v1
add f24204d28 TIKA-4526: Fix nondeterministic failures in
TranslateResourceTest by splitting @PUT/@POST handlers (#2368)
add 9d30530d3 TIKA-4525: restore aws v1
add 8ba427a4c clarify warning in PipesClient
add d659dfc90 remove shade-plugin where possible (#2373)
add c9b9770ac TIKA-4515 bug fixes (#2374)
add dbcc6ac08 TIKA-4327: update aws, google cloud, azure, zstd,
error_prone_annotations, maven antrun, opennlp
add 1acafa29f TIKA-4327: update aws, spring
add 23eee9221 TIKA-4525: close inputstream
add 6af7a55e1 TIKA-4525: remove workaround that is only needed for
putObject
add fb76290e3 TIKA-4525: migrate to aws v2
add 6b0548f1b TIKA-4525: simplify getObject, as suggested by ChatGPT
add f53ab820e TIKA-4525: set default values, remove <pipesIterators>, add
credentialsProvider
add 612d0ed0e TIKA-4525: set default values, remove <pipesIterators>, add
credentialsProvider, fix name
add 5c3c2d581 TIKA-4525: set default values, add comments
add 22187709d TIKA-4327: update microsoft graph
add 08b48fa16 TIKA-4327: update aws
add 452ff85b0 TIKA-4327: update quartz, plugin annotations
add 55243349d TIKA-4525: remove obsolete entry
add 14b0c94ad TIKA-4327: update jackcess
add a1f004fcb Bump com.puppycrawl.tools:checkstyle from 12.1.0 to 12.1.1
(#2375)
add e15e9f22e avoid npe when extracting javascript from names tree
new d173308da Merge branch 'main' into TIKA-4519
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
CHANGES.txt | 14 +--
.../src/main/java/org/apache/tika/cli/TikaCLI.java | 4 +-
.../test/java/org/apache/tika/cli/TikaCLITest.java | 31 ++++-
.../src/test/resources/test-data}/testPST.pst | Bin
.../extractor/DefaultEmbeddedStreamTranslator.java | 21 ++--
.../tika/extractor/EmbeddedStreamTranslator.java | 8 +-
.../apache/tika/extractor/RUnpackExtractor.java | 36 ++++--
.../java/org/apache/tika/io/FilenameUtils.java | 31 ++++-
.../java/org/apache/tika/io/FilenameUtilsTest.java | 8 +-
tika-detectors/tika-detector-magika/pom.xml | 36 ------
tika-detectors/tika-detector-siegfried/pom.xml | 36 ------
.../tika/pipes/kafka/tests/TikaPipesKafkaTest.java | 2 +-
.../tika-pipes-s3-integration-tests/pom.xml | 5 +
.../tika/pipes/s3/tests/PipeIntegrationTests.java | 49 +++++---
.../tika/pipes/s3/tests/S3PipeIntegrationTest.java | 77 +++++++------
.../src/test/resources/tika-config-s3ToFs.xml | 21 ++--
.../src/test/resources/tika-config-s3Tos3.xml | 23 ++--
.../pipes/solr/tests/TikaPipesSolrTestBase.java | 2 +-
tika-parent/pom.xml | 66 ++++++-----
tika-parsers/pom.xml | 2 +-
.../tika-parser-scientific-package/pom.xml | 47 --------
.../tika-parser-sqlite3-package/pom.xml | 41 -------
tika-parsers/tika-parsers-ml/pom.xml | 2 +
.../microsoft/MSEmbeddedStreamTranslator.java | 39 +++----
.../microsoft/PSTEmailStreamTranslator.java | 55 +++++++++
....apache.tika.extractor.EmbeddedStreamTranslator | 3 +-
.../apache/tika/parser/pdf/AbstractPDF2XHTML.java | 3 +
.../org/apache/tika/async/cli/TikaAsyncCLI.java | 2 +-
.../tika/async/cli/TikaConfigAsyncWriter.java | 8 +-
.../tika-emitters/tika-emitter-az-blob/pom.xml | 43 -------
tika-pipes/tika-emitters/tika-emitter-gcs/pom.xml | 43 -------
.../tika-emitters/tika-emitter-kafka/pom.xml | 43 -------
.../tika-emitters/tika-emitter-opensearch/pom.xml | 43 -------
tika-pipes/tika-emitters/tika-emitter-s3/pom.xml | 51 +--------
.../apache/tika/pipes/emitter/s3/S3Emitter.java | 111 +++++++++++-------
tika-pipes/tika-emitters/tika-emitter-solr/pom.xml | 43 -------
.../tika-fetchers/tika-fetcher-az-blob/pom.xml | 43 -------
tika-pipes/tika-fetchers/tika-fetcher-gcs/pom.xml | 43 -------
tika-pipes/tika-fetchers/tika-fetcher-http/pom.xml | 42 -------
.../tika-fetcher-microsoft-graph/pom.xml | 45 +-------
tika-pipes/tika-fetchers/tika-fetcher-s3/pom.xml | 55 ++-------
.../apache/tika/pipes/fetcher/s3/S3Fetcher.java | 127 ++++++++++++---------
.../org/apache/tika/pipes/core/PipesClient.java | 2 +-
.../AbstractEmbeddedDocumentBytesHandler.java | 37 +-----
.../tika-pipes-iterator-az-blob/pom.xml | 43 -------
.../tika-pipes-iterator-csv/pom.xml | 43 -------
.../tika-pipes-iterator-gcs/pom.xml | 43 -------
.../tika-pipes-iterator-jdbc/pom.xml | 43 -------
.../tika-pipes-iterator-json/pom.xml | 43 -------
.../tika-pipes-iterator-kafka/pom.xml | 43 -------
.../tika-pipes-iterator-s3/pom.xml | 51 +--------
.../pipes/pipesiterator/s3/S3PipesIterator.java | 97 ++++++++++------
.../tika-pipes-iterator-solr/pom.xml | 43 -------
.../tika-pipes-reporter-fs-status/pom.xml | 43 -------
.../tika-pipes-reporter-jdbc/pom.xml | 43 -------
.../tika-pipes-reporter-opensearch/pom.xml | 43 -------
tika-server/tika-server-client/pom.xml | 47 --------
.../server/core/resource/TranslateResource.java | 50 +++++---
.../server/core/resource/UnpackerResource.java | 27 ++---
tika-translate/pom.xml | 2 +-
60 files changed, 599 insertions(+), 1548 deletions(-)
copy
{tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/test/resources/test-documents
=> tika-app/src/test/resources/test-data}/testPST.pst (100%)
create mode 100644
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/extractor/microsoft/PSTEmailStreamTranslator.java