(tika) branch dependabot/maven/aws.version-1.12.655 deleted (was 1cd7a7b3c)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/aws.version-1.12.655 in repository https://gitbox.apache.org/repos/asf/tika.git was 1cd7a7b3c Bump aws.version from 1.12.654 to 1.12.655 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (d461ffc61 -> 87d3178b5)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from d461ffc61 Merge pull request #1584 from apache/dependabot/maven/org.apache.solr-solr-solrj-8.11.3 add 1cd7a7b3c Bump aws.version from 1.12.654 to 1.12.655 new 87d3178b5 Merge pull request #1583 from apache/dependabot/maven/aws.version-1.12.655 The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) 01/01: Merge pull request #1583 from apache/dependabot/maven/aws.version-1.12.655
This is an automated email from the ASF dual-hosted git repository. tilman pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 87d3178b51d17232d447a13c5453bf8e0cb8b2e5 Merge: d461ffc61 1cd7a7b3c Author: Tilman Hausherr AuthorDate: Fri Feb 9 07:57:38 2024 +0100 Merge pull request #1583 from apache/dependabot/maven/aws.version-1.12.655 Bump aws.version from 1.12.654 to 1.12.655 tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/org.apache.solr-solr-solrj-8.11.3 deleted (was cb572342b)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.apache.solr-solr-solrj-8.11.3 in repository https://gitbox.apache.org/repos/asf/tika.git was cb572342b Bump org.apache.solr:solr-solrj from 8.11.2 to 8.11.3 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (51605eeb4 -> d461ffc61)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 51605eeb4 Merge pull request #1585 from apache/dependabot/maven/commons-codec-commons-codec-1.16.1 add cb572342b Bump org.apache.solr:solr-solrj from 8.11.2 to 8.11.3 add d461ffc61 Merge pull request #1584 from apache/dependabot/maven/org.apache.solr-solr-solrj-8.11.3 No new revisions were added by this update. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/commons-codec-commons-codec-1.16.1 deleted (was bdc8fa1e2)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/commons-codec-commons-codec-1.16.1 in repository https://gitbox.apache.org/repos/asf/tika.git was bdc8fa1e2 Bump commons-codec:commons-codec from 1.16.0 to 1.16.1 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) 01/01: Merge pull request #1585 from apache/dependabot/maven/commons-codec-commons-codec-1.16.1
This is an automated email from the ASF dual-hosted git repository. tilman pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 51605eeb41a084afc1bbb0466ce5cb3524f37858 Merge: 7925830b8 bdc8fa1e2 Author: Tilman Hausherr AuthorDate: Fri Feb 9 07:57:16 2024 +0100 Merge pull request #1585 from apache/dependabot/maven/commons-codec-commons-codec-1.16.1 Bump commons-codec:commons-codec from 1.16.0 to 1.16.1 tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch main updated (7925830b8 -> 51605eeb4)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 7925830b8 Merge pull request #1586 from apache/dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 add bdc8fa1e2 Bump commons-codec:commons-codec from 1.16.0 to 1.16.1 new 51605eeb4 Merge pull request #1585 from apache/dependabot/maven/commons-codec-commons-codec-1.16.1 The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 deleted (was 2e92b79dc)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 in repository https://gitbox.apache.org/repos/asf/tika.git was 2e92b79dc Bump org.testcontainers:testcontainers-bom from 1.19.4 to 1.19.5 The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated (16e1bc9c8 -> 7925830b8)
This is an automated email from the ASF dual-hosted git repository. tilman pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 16e1bc9c8 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter (#1582) add 2e92b79dc Bump org.testcontainers:testcontainers-bom from 1.19.4 to 1.19.5 add 7925830b8 Merge pull request #1586 from apache/dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 No new revisions were added by this update. Summary of changes: tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)
(tika) branch dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 created (now 2e92b79dc)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.testcontainers-testcontainers-bom-1.19.5 in repository https://gitbox.apache.org/repos/asf/tika.git at 2e92b79dc Bump org.testcontainers:testcontainers-bom from 1.19.4 to 1.19.5 No new revisions were added by this update.
(tika) branch dependabot/maven/commons-codec-commons-codec-1.16.1 created (now bdc8fa1e2)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/commons-codec-commons-codec-1.16.1 in repository https://gitbox.apache.org/repos/asf/tika.git at bdc8fa1e2 Bump commons-codec:commons-codec from 1.16.0 to 1.16.1 No new revisions were added by this update.
(tika) branch dependabot/maven/org.apache.solr-solr-solrj-8.11.3 created (now cb572342b)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.apache.solr-solr-solrj-8.11.3 in repository https://gitbox.apache.org/repos/asf/tika.git at cb572342b Bump org.apache.solr:solr-solrj from 8.11.2 to 8.11.3 No new revisions were added by this update.
(tika) branch dependabot/maven/aws.version-1.12.655 created (now 1cd7a7b3c)
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/aws.version-1.12.655 in repository https://gitbox.apache.org/repos/asf/tika.git at 1cd7a7b3c Bump aws.version from 1.12.654 to 1.12.655 No new revisions were added by this update.
(tika) branch TIKA-4193 deleted (was d07fb16b1)
This is an automated email from the ASF dual-hosted git repository. tallison pushed a change to branch TIKA-4193 in repository https://gitbox.apache.org/repos/asf/tika.git was d07fb16b1 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter The revisions that were on this branch are still contained in other references; therefore, this change does not discard any commits from the repository.
(tika) branch main updated: TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter (#1582)
This is an automated email from the ASF dual-hosted git repository. tallison pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 16e1bc9c8 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter (#1582) 16e1bc9c8 is described below commit 16e1bc9c8e4f5e253fc519a477da92410730d060 Author: Tim Allison AuthorDate: Thu Feb 8 15:05:02 2024 -0500 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter (#1582) --- .../org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java| 4 .../apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java| 1 + 2 files changed, 5 insertions(+) diff --git a/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java b/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java index 0ac65d240..811958af4 100644 --- a/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java +++ b/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java @@ -48,6 +48,9 @@ public class TikaEvalMetadataFilter extends MetadataFilter { public static Property NUM_ALPHA_TOKENS = Property.externalInteger(TIKA_EVAL_NS + "numAlphaTokens"); +public static Property NUM_COMMON_TOKENS = +Property.externalInteger(TIKA_EVAL_NS + "numCommonTokens"); + public static Property NUM_UNIQUE_ALPHA_TOKENS = Property.externalInteger(TIKA_EVAL_NS + "numUniqueAlphaTokens"); @@ -90,6 +93,7 @@ public class TikaEvalMetadataFilter extends MetadataFilter { CommonTokenResult commonTokenResult = (CommonTokenResult) results.get(CommonTokens.class); metadata.set(NUM_ALPHA_TOKENS, commonTokenResult.getAlphabeticTokens()); metadata.set(NUM_UNIQUE_ALPHA_TOKENS, commonTokenResult.getUniqueAlphabeticTokens()); +metadata.set(NUM_COMMON_TOKENS, commonTokenResult.getCommonTokens()); if (commonTokenResult.getAlphabeticTokens() > 0) { metadata.set(OUT_OF_VOCABULARY, commonTokenResult.getOOV()); } else { diff --git a/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java b/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java index 1961698b4..f1fd21c21 100644 --- a/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java +++ b/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java @@ -42,6 +42,7 @@ public class TikaEvalMetadataFilterTest { assertEquals(11, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_UNIQUE_TOKENS)); assertEquals(10, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_ALPHA_TOKENS)); assertEquals(9, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_UNIQUE_ALPHA_TOKENS)); +assertEquals(9, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_COMMON_TOKENS)); assertEquals(0.0999,
(tika) branch TIKA-4193 created (now d07fb16b1)
This is an automated email from the ASF dual-hosted git repository. tallison pushed a change to branch TIKA-4193 in repository https://gitbox.apache.org/repos/asf/tika.git at d07fb16b1 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter This branch includes the following new commits: new d07fb16b1 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference.
(tika) 01/01: TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter
This is an automated email from the ASF dual-hosted git repository. tallison pushed a commit to branch TIKA-4193 in repository https://gitbox.apache.org/repos/asf/tika.git commit d07fb16b132294ced01a9ce64ae7f8263149f3d8 Author: tallison AuthorDate: Thu Feb 8 14:38:30 2024 -0500 TIKA-4193 -- add num common tokens to TikaEvalMetadataFilter --- .../org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java| 4 .../apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java| 1 + 2 files changed, 5 insertions(+) diff --git a/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java b/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java index 0ac65d240..811958af4 100644 --- a/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java +++ b/tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilter.java @@ -48,6 +48,9 @@ public class TikaEvalMetadataFilter extends MetadataFilter { public static Property NUM_ALPHA_TOKENS = Property.externalInteger(TIKA_EVAL_NS + "numAlphaTokens"); +public static Property NUM_COMMON_TOKENS = +Property.externalInteger(TIKA_EVAL_NS + "numCommonTokens"); + public static Property NUM_UNIQUE_ALPHA_TOKENS = Property.externalInteger(TIKA_EVAL_NS + "numUniqueAlphaTokens"); @@ -90,6 +93,7 @@ public class TikaEvalMetadataFilter extends MetadataFilter { CommonTokenResult commonTokenResult = (CommonTokenResult) results.get(CommonTokens.class); metadata.set(NUM_ALPHA_TOKENS, commonTokenResult.getAlphabeticTokens()); metadata.set(NUM_UNIQUE_ALPHA_TOKENS, commonTokenResult.getUniqueAlphabeticTokens()); +metadata.set(NUM_COMMON_TOKENS, commonTokenResult.getCommonTokens()); if (commonTokenResult.getAlphabeticTokens() > 0) { metadata.set(OUT_OF_VOCABULARY, commonTokenResult.getOOV()); } else { diff --git a/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java b/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java index 1961698b4..f1fd21c21 100644 --- a/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java +++ b/tika-eval/tika-eval-core/src/test/java/org/apache/tika/eval/core/metadata/TikaEvalMetadataFilterTest.java @@ -42,6 +42,7 @@ public class TikaEvalMetadataFilterTest { assertEquals(11, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_UNIQUE_TOKENS)); assertEquals(10, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_ALPHA_TOKENS)); assertEquals(9, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_UNIQUE_ALPHA_TOKENS)); +assertEquals(9, (int) metadata.getInt(TikaEvalMetadataFilter.NUM_COMMON_TOKENS)); assertEquals(0.0999,
(tika) branch branch_2x updated: TIKA-4162: update jackrabbit, google cloud, aws, junit, sqlite, fastutil
This is an automated email from the ASF dual-hosted git repository. tilman pushed a commit to branch branch_2x in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/branch_2x by this push: new 29d3598a6 TIKA-4162: update jackrabbit, google cloud, aws, junit, sqlite, fastutil 29d3598a6 is described below commit 29d3598a64ec78394c0004271526951917c4cece Author: Tilman Hausherr AuthorDate: Thu Feb 8 12:07:38 2024 +0100 TIKA-4162: update jackrabbit, google cloud, aws, junit, sqlite, fastutil --- tika-parent/pom.xml | 12 ++-- 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/tika-parent/pom.xml b/tika-parent/pom.xml index b643ef0a8..69cf3c039 100644 --- a/tika-parent/pom.xml +++ b/tika-parent/pom.xml @@ -306,8 +306,8 @@ 0.16.1 -2.31.0 -1.12.651 +2.33.0 +1.12.654 62.2 1.4.0 -2.21.22 +2.21.23 2.16.1 1.3.2 2.0 @@ -361,7 +361,7 @@ 5.14.0 1.1.1 4.13.2 -5.10.1 +5.10.2 7.5.5 0.9.3 2.20.0 @@ -394,7 +394,7 @@ 8.11.2 5.3.31 -3.45.0.0 +3.45.1.0 1.2.1 1.19.4 @@ -685,7 +685,7 @@ it.unimi.dsi fastutil -8.5.12 +8.5.13 javax.annotation
(tika) branch main updated: TIKA-4166: update jackrabbit
This is an automated email from the ASF dual-hosted git repository. tilman pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 255cc7936 TIKA-4166: update jackrabbit 255cc7936 is described below commit 255cc7936813388024f1833cfb854f7ce377529f Author: Tilman Hausherr AuthorDate: Thu Feb 8 11:31:25 2024 +0100 TIKA-4166: update jackrabbit --- tika-parent/pom.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tika-parent/pom.xml b/tika-parent/pom.xml index 140756327..640df7120 100644 --- a/tika-parent/pom.xml +++ b/tika-parent/pom.xml @@ -351,7 +351,7 @@ 4.4.16 74.2 1.4.0 -2.21.22 +2.21.23 2.16.1 4.0.5 4.0.2