(tika) branch dependabot/maven/org.xerial-sqlite-jdbc-3.45.2.0 deleted (was e65d52cb5)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.xerial-sqlite-jdbc-3.45.2.0 in repository https://gitbox.apache.org/repos/asf/tika.git was e65d52cb5 Bump org.xerial:sqlite-jdbc from 3.45.1.0 to 3.45.2.0 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (49064dbe2 -> 846f3a080)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 49064dbe2 Merge pull request #1652 from apache/dependabot/maven/aws.version-1.12.678 add e65d52cb5 Bump org.xerial:sqlite-jdbc from 3.45.1.0 to 3.45.2.0 add 846f3a080 Merge pull request #1655 from apache/dependabot/maven/org.xerial-sqlite-jdbc-3.45.2.0 No new revisions were added by this update. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/org.xerial-sqlite-jdbc-3.45.2.0 created (now e65d52cb5)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.xerial-sqlite-jdbc-3.45.2.0 in repository https://gitbox.apache.org/repos/asf/tika.git at e65d52cb5 Bump org.xerial:sqlite-jdbc from 3.45.1.0 to 3.45.2.0 No new revisions were added by this update.
(tika) branch dependabot/maven/io.projectreactor-reactor-core-3.6.4 created (now 0dd5861ee)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/io.projectreactor-reactor-core-3.6.4 in repository https://gitbox.apache.org/repos/asf/tika.git at 0dd5861ee Bump io.projectreactor:reactor-core from 3.6.2 to 3.6.4 No new revisions were added by this update.
(tika) branch dependabot/maven/reactor.netty.version-1.1.17 created (now 23d26d770)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/reactor.netty.version-1.1.17 in repository https://gitbox.apache.org/repos/asf/tika.git at 23d26d770 Bump reactor.netty.version from 1.1.15 to 1.1.17 No new revisions were added by this update.
(tika) branch dependabot/maven/aws.version-1.12.678 deleted (was 3ffadd5a3)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/aws.version-1.12.678 in repository https://gitbox.apache.org/repos/asf/tika.git was 3ffadd5a3 Bump aws.version from 1.12.677 to 1.12.678 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (3a7bbc50d -> 49064dbe2)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 3a7bbc50d Merge pull request #1651 from apache/dependabot/maven/aws.version-1.12.677 add 3ffadd5a3 Bump aws.version from 1.12.677 to 1.12.678 add 49064dbe2 Merge pull request #1652 from apache/dependabot/maven/aws.version-1.12.678 No new revisions were added by this update. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/com.fasterxml.jackson-jackson-bom-2.17.0 created (now be7640d53)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/com.fasterxml.jackson-jackson-bom-2.17.0 in repository https://gitbox.apache.org/repos/asf/tika.git at be7640d53 Bump com.fasterxml.jackson:jackson-bom from 2.16.2 to 2.17.0 No new revisions were added by this update.
(tika) branch dependabot/maven/aws.version-1.12.678 created (now 3ffadd5a3)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/aws.version-1.12.678 in repository https://gitbox.apache.org/repos/asf/tika.git at 3ffadd5a3 Bump aws.version from 1.12.677 to 1.12.678 No new revisions were added by this update.
(tika) branch TIKA-4207 updated: TIKA-4207 basically working
This is an automated email from the ASF dual-hosted git repository. tallison pushed a commit to branch TIKA-4207 in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/TIKA-4207 by this push: new 3e18b889d TIKA-4207 basically working 3e18b889d is described below commit 3e18b889ded54e746c4dbd25580d9ac8f73720cf Author: tallison AuthorDate: Tue Mar 12 19:52:26 2024 -0400 TIKA-4207 basically working --- .../java/org/apache/tika/cli/TikaCLIAsyncTest.java | 16 ++ .../test/java/org/apache/tika/cli/TikaCLITest.java | 2 - .../AbstractEmbeddedDocumentByteStore.java | 17 ++- .../extractor/ParsingAndEmbeddedDocExtractor.java | 162 - .../ParsingAndEmbeddedDocExtractorFactory.java | 40 - .../ParsingEmbeddedDocumentExtractor.java | 41 +- .../apache/tika/parser/RecursiveParserWrapper.java | 2 + .../java/org/apache/tika/pipes/FetchEmitTuple.java | 2 +- .../java/org/apache/tika/pipes/PipesServer.java| 144 +- .../extractor/EmbeddedDocumentBytesConfig.java | 73 +- .../extractor/EmbeddedDocumentEmitterStore.java| 18 ++- .../org/apache/tika/parser/mock/MockParser.java| 26 +--- .../org/apache/tika/pipes/PipesServerTest.java | 60 +++- ...rocessorTest.java => AsyncChaosMonkeyTest.java} | 2 +- .../resources/org/apache/tika/pipes/TIKA-4207.xml | 30 tika-pipes/tika-async-cli/pom.xml | 7 + .../apache/tika/async/cli/AsyncProcessorTest.java | 138 ++ .../apache/tika/async/cli/TikaAsyncCLITest.java| 2 +- .../test/resources/configs/TIKA-4207-emitter.xml | 35 + .../resources/{ => configs}/tika-config-broken.xml | 0 .../basic_embedded.xml}| 29 ++-- tika-pipes/tika-pipes-iterators/pom.xml| 1 + .../tika-pipes-iterator-json}/pom.xml | 43 +++--- .../pipesiterator/json/JsonPipesIterator.java | 65 + .../pipesiterator/json/TestJsonPipesIterator.java | 85 +++ .../test-documents/test-with-embedded-bytes.json | 100 + .../src/test/resources/test-documents/test.json| 100 + .../metadata/serialization/JsonFetchEmitTuple.java | 42 +- .../serialization/JsonFetchEmitTupleTest.java | 20 +++ 29 files changed, 940 insertions(+), 362 deletions(-) diff --git a/tika-app/src/test/java/org/apache/tika/cli/TikaCLIAsyncTest.java b/tika-app/src/test/java/org/apache/tika/cli/TikaCLIAsyncTest.java index 1f6c8fc2c..d9f6d053f 100644 --- a/tika-app/src/test/java/org/apache/tika/cli/TikaCLIAsyncTest.java +++ b/tika-app/src/test/java/org/apache/tika/cli/TikaCLIAsyncTest.java @@ -1,3 +1,19 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ package org.apache.tika.cli; import static java.nio.charset.StandardCharsets.UTF_8; diff --git a/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java b/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java index c160db396..fa16e124a 100644 --- a/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java +++ b/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java @@ -30,9 +30,7 @@ import java.net.URI; import java.nio.file.Files; import java.nio.file.Path; -import org.apache.commons.io.FileUtils; import org.junit.jupiter.api.AfterEach; -import org.junit.jupiter.api.BeforeAll; import org.junit.jupiter.api.BeforeEach; import org.junit.jupiter.api.Test; import org.junit.jupiter.api.io.TempDir; diff --git a/tika-core/src/main/java/org/apache/tika/extractor/AbstractEmbeddedDocumentByteStore.java b/tika-core/src/main/java/org/apache/tika/extractor/AbstractEmbeddedDocumentByteStore.java index c435a3e6e..cbc1f3411 100644 --- a/tika-core/src/main/java/org/apache/tika/extractor/AbstractEmbeddedDocumentByteStore.java +++ b/tika-core/src/main/java/org/apache/tika/extractor/AbstractEmbeddedDocumentByteStore.java @@ -30,30 +30,31 @@ public abstract class AbstractEmbeddedDocumentByteStore implements EmbeddedDocum List ids = new ArrayList<>(); -public String getFetchKey(String containerFetchKey, int embeddedId, -
(tika) branch dependabot/maven/org.apache.zookeeper-zookeeper-3.9.2 deleted (was 5c915e42b)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.apache.zookeeper-zookeeper-3.9.2 in repository https://gitbox.apache.org/repos/asf/tika.git was 5c915e42b Bump org.apache.zookeeper:zookeeper from 3.8.3 to 3.9.2 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch dependabot/maven/aws.version-1.12.677 deleted (was 84f0a5b7f)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/aws.version-1.12.677 in repository https://gitbox.apache.org/repos/asf/tika.git was 84f0a5b7f Bump aws.version from 1.12.676 to 1.12.677 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (bdb6a4656 -> 3a7bbc50d)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from bdb6a4656 Merge pull request #1647 from apache/dependabot/maven/com.fasterxml.jackson-jackson-bom-2.16.2 add 84f0a5b7f Bump aws.version from 1.12.676 to 1.12.677 add 3a7bbc50d Merge pull request #1651 from apache/dependabot/maven/aws.version-1.12.677 No new revisions were added by this update. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)