[tika] branch main updated (9488d076e -> 02f0d0441)

2023-06-08 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 9488d076e bump OpenSearch version to latest add 500900d67 TIKA-4060 Test AAC files, based on testWAV.wav, one without

[tika] 01/01: Merge branch 'main' into TIKA-4060

2023-06-08 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git commit 02f0d0441b8c380faebcf8bb14d6f91b0252f058 Merge: 04021e427 9488d076e Author: Nick Burch AuthorDate: Thu Jun 8 22:12:00

[tika] branch TIKA-4060 updated (04021e427 -> 02f0d0441)

2023-06-08 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git from 04021e427 Hex values in a match regex need escaping to be treated as hex add d72077833 Bump aws.version from

[tika] branch TIKA-4060 updated: Hex values in a match regex need escaping to be treated as hex

2023-06-08 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/TIKA-4060 by this push: new 04021e427 Hex values in a match regex need

[tika] 03/03: AAC detection tests, ID3 one currently failing...

2023-06-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git commit 8f838c512c6880ba21d2d6df36f592614710aba8 Author: Nick Burch AuthorDate: Wed Jun 7 23:58:11 2023 +0100 AAC detection

[tika] branch TIKA-4060 created (now 8f838c512)

2023-06-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git at 8f838c512 AAC detection tests, ID3 one currently failing... This branch includes the following new commits: new

[tika] 02/03: AAC magic, based on PRONOM patterns found by Gregory Lepore

2023-06-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git commit ae85b9e4e4fb897ec901779fa7301c9316fb9a79 Author: Nick Burch AuthorDate: Wed Jun 7 23:57:46 2023 +0100 AAC magic

[tika] 01/03: TIKA-4060 Test AAC files, based on testWAV.wav, one without ID3, one with dummy ID3 values

2023-06-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch TIKA-4060 in repository https://gitbox.apache.org/repos/asf/tika.git commit 500900d67ede02e87440caa9f67501d3fe59b770 Author: Nick Burch AuthorDate: Wed Jun 7 23:56:55 2023 +0100 TIKA-4060

[tika] branch main updated (0d7a42f34 -> fc887690a)

2022-07-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 0d7a42f34 TIKA-3795: update protobuf new 9d928bbf9 TIKA-3810 VTT with UTF-8 BOM new ec4cb612d WebVTT is text

[tika] 03/03: Merge branch 'main' of https://github.com/apache/tika into main

2022-07-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit fc887690a91a4b689a40a0be11d68dcdeb45a66f Merge: ec4cb612d 0d7a42f34 Author: Nick Burch AuthorDate: Tue Jul 5 11:32:57 2022

[tika] 01/03: TIKA-3810 VTT with UTF-8 BOM

2022-07-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 9d928bbf9e93131d5021d4e5afddb4ba18df6531 Author: Nick Burch AuthorDate: Tue Jul 5 11:21:17 2022 +0100 TIKA-3810 VTT

[tika] 02/03: WebVTT is text based, so check for both line endings on the BOM cases like we do for no-BOM

2022-07-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit ec4cb612d1cda09907c88f2c5a06cc3cb7a839ef Author: Nick Burch AuthorDate: Tue Jul 5 11:22:59 2022 +0100 WebVTT is text

[tika] 01/02: Crypto test files - Encrypted version of testRSAKEY.pem, and a PKCS12 wrapped version

2022-06-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 8ef8d636f87dd571a8dc844d1d7ac503522b13ed Author: Nick Burch AuthorDate: Sun Jun 5 15:34:54 2022 +0100 Crypto test files

[tika] branch main updated (5e3dab7ae -> 6bf9ee120)

2022-06-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git from 5e3dab7ae TIKA-3751: update aws new 8ef8d636f Crypto test files - Encrypted version of testRSAKEY.pem, and a PKCS12

[tika] 02/02: Tests for encrypted RSA keys in PEM and DER, plus a disabled PKCS12 test pending TIKA-3784

2022-06-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 6bf9ee120c2845ccdf61207322dcea2373388e75 Author: Nick Burch AuthorDate: Sun Jun 5 15:48:36 2022 +0100 Tests

[tika] branch main updated: PDP-11 style "Middle Endian" 32 bit read util, as used in the DGN file format

2022-04-28 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new f33d8930e PDP-11 style "Middle Endian" 32 bit

[tika] 01/03: TIKA-3694 Additional details in HTML on mime type, and per-type json

2022-03-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 7768c87b467bb6cc9d01f6b92c45131af3d44fef Author: Nick Burch AuthorDate: Mon Mar 7 22:49:22 2022 + TIKA-3694

[tika] branch main updated (eda4427 -> d583973)

2022-03-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git. from eda4427 Merge branch 'TIKA-3689' into main new 7768c87 TIKA-3694 Additional details in HTML on mime type, and per

[tika] 03/03: TIKA-3694 Unit test for type-specific page

2022-03-07 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit d583973f829aa2b48b8a08cb8c46927a3446ca7a Author: Nick Burch AuthorDate: Mon Mar 7 23:30:15 2022 + TIKA-3694 Unit

[tika] branch branch_1x updated: TIKA-3373 Add the *.yml extension for YAML, which is commonly used, along with aliases for popular alternate mimetypes for it

2021-04-27 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/branch_1x by this push: new f7d5119 TIKA-3373 Add the *.yml extension

[tika] branch main updated: TIKA-3373 Add the *.yml extension for YAML, which is commonly used, along with aliases for popular alternate mimetypes for it

2021-04-27 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 60c0aae TIKA-3373 Add the *.yml extension for YAML

[tika] branch branch_1x updated: Changelog update

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/branch_1x by this push: new a1ec3fd Changelog update a1ec3fd

[tika] branch branch_1x updated: Backport to 1.x - TIKA-3310 Check if MP4 file's compatible brands match any of the expected values, from Peter Kronenberg

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/branch_1x by this push: new b0242ee Backport to 1.x - TIKA-3310 Check

[tika] branch main updated (356cf44 -> 4bd931d)

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git. from 356cf44 TIKA-3318 Document the units of xmpDM:duration as seconds by default new d80dc36 TIKA-3310 Check if MP4

[tika] branch branch_1x updated: Changelog update

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/branch_1x by this push: new a4c9257 Changelog update a4c9257

[tika] 02/02: TIKA-3318 MP3 parser should output the xmpDM:duration metadata as seconds not milliseconds

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 21b3cf8b5a209ab6cf0176d8bc55e640fdc8c351 Author: Nick Burch AuthorDate: Sun Mar 14 20:20:14 2021 + TIKA-3318

[tika] branch branch_1x updated (02ed830 -> 21b3cf8)

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git. from 02ed830 TIKA-3244: update pax-url-aether new 8081e6d TIKA-3318 Document the units of xmpDM:duration as seconds

[tika] 01/02: TIKA-3318 Document the units of xmpDM:duration as seconds by default

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 8081e6da8d34ef9675638699eb2ec6d6145c89d4 Author: Nick Burch AuthorDate: Sun Mar 14 19:24:43 2021 + TIKA-3318

[tika] branch main updated: TIKA-3318 Document the units of xmpDM:duration as seconds by default

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 356cf44 TIKA-3318 Document the units of xmpDM:duration

[tika] branch main updated: TIKA-3318 MP3 parser should output the xmpDM:duration metadata as seconds not milliseconds

2021-03-14 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 31da853 TIKA-3318 MP3 parser should output

[tika] 04/05: Split the Certificate and Key mimetypes into DER and PEM subtypes, add test EC files. TIKA-3205

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 51829d630360060d2fff84e8dc2b1346834ecfda Author: Nick Burch AuthorDate: Tue Sep 29 16:48:40 2020 +0100 Split

[tika] branch branch_1x updated (9736af8 -> 1fce089)

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git. from 9736af8 Fix TIKA-3196 (#364) new 5c2c4a2 Add test certificate and key for TIKA-3205 new 28ec71d TIKA

[tika] 02/05: TIKA-3205 Add magic for X509 PEM certificate, and tweak default type

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 28ec71d2d3afa52e84fa16ee5df289dd696980ed Author: Nick Burch AuthorDate: Tue Sep 29 15:49:14 2020 +0100 TIKA-3205

[tika] 03/05: Add some more DER magic for certificates, and add tests TIKA-3205

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit e95287761da40c72f45390d1b892d8cdef33c216 Author: Nick Burch AuthorDate: Tue Sep 29 16:23:08 2020 +0100 Add some

[tika] 05/05: Make the DER private key mostly-match a bit more specific

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 1fce08921cea11bc79c708d4f72b9e4bf70b8c2c Author: Nick Burch AuthorDate: Tue Sep 29 16:51:19 2020 +0100 Make the DER

[tika] 01/05: Add test certificate and key for TIKA-3205

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch branch_1x in repository https://gitbox.apache.org/repos/asf/tika.git commit 5c2c4a2fb91cc160eaf007b71efcd854402e1624 Author: Nick Burch AuthorDate: Tue Sep 29 15:26:48 2020 +0100 Add test

[tika] branch main updated: Move new test files to 2.x folder, doh!

2020-09-30 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/main by this push: new 0844dce Move new test files to 2.x folder, doh

[tika] 02/05: TIKA-3205 Add magic for X509 PEM certificate, and tweak default type

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit ecd1d62ad9e4d2ddd53abf204539e5d765e6c624 Author: Nick Burch AuthorDate: Tue Sep 29 15:49:14 2020 +0100 TIKA-3205 Add

[tika] 04/05: Split the Certificate and Key mimetypes into DER and PEM subtypes, add test EC files. TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit fa1b2ef87157f51797d0dcaed36ebc990e538910 Author: Nick Burch AuthorDate: Tue Sep 29 16:48:40 2020 +0100 Split

[tika] 01/05: Add test certificate and key for TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit ad0d98b9a155e483b815eb01e36ebd02a101695a Author: Nick Burch AuthorDate: Tue Sep 29 15:26:48 2020 +0100 Add test

[tika] 05/05: Make the DER private key mostly-match a bit more specific

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit 75c2ff5686a70c0fb15c4b52534c1be09669af1b Author: Nick Burch AuthorDate: Tue Sep 29 16:51:19 2020 +0100 Make the DER

[tika] 03/05: Add some more DER magic for certificates, and add tests TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/tika.git commit b0ae63a1c59ef60ac6b134cadf2053f2e73152d4 Author: Nick Burch AuthorDate: Tue Sep 29 16:23:08 2020 +0100 Add some more DER

[tika] branch main updated (6591b32 -> 75c2ff5)

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/tika.git. from 6591b32 TIKA-3196 -- ensure that entryCnt is thread-safe across parses; add integration test; clean up existing unused

[tika] 04/05: Split the Certificate and Key mimetypes into DER and PEM subtypes, add test EC files. TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit c6b30c578e98373496f895cd7caa8317f4212d51 Author: Nick Burch AuthorDate: Tue Sep 29 16:48:40 2020 +0100 Split

[tika] 03/05: Add some more DER magic for certificates, and add tests TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit eaa712f89d5de9ad06647fa29d10ac1baa47a4c0 Author: Nick Burch AuthorDate: Tue Sep 29 16:23:08 2020 +0100 Add some more

[tika] branch master updated (62fe4ad -> 6183452)

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from 62fe4ad TIKA-3104 -- add detection and parsing for xml based plist files new 5fdb70a Add test certificate

[tika] 01/05: Add test certificate and key for TIKA-3205

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 5fdb70ae4770301d6b101e9007a1058e15abac94 Author: Nick Burch AuthorDate: Tue Sep 29 15:26:48 2020 +0100 Add test

[tika] 05/05: Make the DER private key mostly-match a bit more specific

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 618345263ee41108e1a225dbcdbb8db16b2aae28 Author: Nick Burch AuthorDate: Tue Sep 29 16:51:19 2020 +0100 Make the DER

[tika] 02/05: TIKA-3205 Add magic for X509 PEM certificate, and tweak default type

2020-09-29 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit c3fff83c7e955ff5de0e4cb9098b06a15ee2cf7e Author: Nick Burch AuthorDate: Tue Sep 29 15:49:14 2020 +0100 TIKA-3205 Add

[tika] branch master updated: Tweak whitespace to be consistent

2020-05-28 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new f233f3b Tweak whitespace to be consistent f233f3b

[tika] branch master updated (0bf11ae -> 1140091)

2020-05-28 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from 0bf11ae TIKA-2961 Make the CAF mime magic more specific to avoid false positives, by checking for a version number

[tika] 01/02: Make the bplist magic more specific where possible, keep version catch-all as now otherwise

2020-05-28 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit e9d62d24c19250053aee07a59c9e4de5197f2f42 Author: Nick Burch AuthorDate: Thu May 28 07:05:30 2020 +0100 Make the bplist

[tika] 02/02: Add glob for Xcode Memgraph files, which are bplist-based

2020-05-28 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 114009165410c91b57b91fc4eaddb089a8559451 Author: Nick Burch AuthorDate: Thu May 28 07:06:14 2020 +0100 Add glob

[tika] branch master updated: TIKA-2961 Make the CAF mime magic more specific to avoid false positives, by checking for a version number after the "caff" header text

2020-05-17 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new 0bf11ae TIKA-2961 Make the CAF mime magic more

[tika] branch master updated: TIKA-3023 Make the SGI Movie mime magic more specific to avoid false positives on text files starting with MOVI

2020-02-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new 0d259bc TIKA-3023 Make the SGI Movie mime magic

[tika] branch master updated: TIKA-3034 Mathematica files don't have a unique magic, but try to detect based on the file starting with a Mathematica-style comment as all we can do. Also add the newer

2020-02-04 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new f5571fa TIKA-3034 Mathematica files don't have

[tika] 04/05: HEIF detection unit test. When tooling improves, should ideally create another HEIF test file with another codec too

2019-11-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 433a8c1625d302bf1a9d81f2ad1223df7bf83d31 Author: Nick Burch AuthorDate: Mon Nov 18 14:57:09 2019 + HEIF detection

[tika] 02/05: Test file uses the HEVC codec, so switch to the more specific extension

2019-11-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit efd071aa595d76d094f549f25db856229baace5d Author: Nick Burch AuthorDate: Mon Nov 18 14:54:42 2019 + Test file uses

[tika] branch master updated (f6a5749 -> 1bb1895)

2019-11-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from f6a5749 TIKA-2982 -- don't require 'DataSpaces' in ooxml-encrypted detection new 8cfacfe Test HEIF file

[tika] 05/05: Changelog update

2019-11-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 1bb1895a30b722a9780122a6447598dd29e75ca7 Author: Nick Burch AuthorDate: Mon Nov 18 15:00:33 2019 + Changelog

[tika] 03/05: Add mimetypes for the HEIF (High Efficiency Image File) format family - TIKA-2942

2019-11-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 0758598bece92f97418f88d0c443e8d9cff7a7ee Author: Nick Burch AuthorDate: Mon Nov 18 14:55:45 2019 + Add mimetypes

svn commit: r1869088 - /tika/site/src/site/resources/doap.rdf

2019-10-28 Thread nick
Author: nick Date: Mon Oct 28 21:35:45 2019 New Revision: 1869088 URL: http://svn.apache.org/viewvc?rev=1869088=rev Log: Correct the RDF link for the projects category, and add us to Content too Modified: tika/site/src/site/resources/doap.rdf Modified: tika/site/src/site/resources/doap.rdf

svn commit: r1867123 - in /tika/site: pom.xml publish/.htaccess publish/source-repository.html

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:56:36 2019 New Revision: 1867123 URL: http://svn.apache.org/viewvc?rev=1867123=rev Log: TIKA-2947 Update the source control details in the site pom, so the auto-generated source repo file is correct Added: tika/site/publish/source-repository.html Removed

svn commit: r1867122 [2/2] - in /tika/site/publish: 0.10/ 0.5/ 0.6/ 0.7/ 0.8/ 0.9/ 1.0/ 1.1/ 1.10/ 1.11/ 1.12/ 1.13/ 1.14/ 1.15/ 1.16/ 1.17/ 1.18/ 1.19.1/ 1.19/ 1.2/ 1.20/ 1.21/ 1.22/ 1.3/ 1.4/ 1.5/ 1

2019-09-18 Thread nick
Modified: tika/site/publish/1.18/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.18/gettingstarted.html?rev=1867122=1867121=1867122=diff == --- tika/site/publish/1.18/gettingstarted.html

svn commit: r1867122 [1/2] - in /tika/site/publish: 0.10/ 0.5/ 0.6/ 0.7/ 0.8/ 0.9/ 1.0/ 1.1/ 1.10/ 1.11/ 1.12/ 1.13/ 1.14/ 1.15/ 1.16/ 1.17/ 1.18/ 1.19.1/ 1.19/ 1.2/ 1.20/ 1.21/ 1.22/ 1.3/ 1.4/ 1.5/ 1

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:50:40 2019 New Revision: 1867122 URL: http://svn.apache.org/viewvc?rev=1867122=rev Log: TIKA-2947 Update source code link Modified: tika/site/publish/0.10/gettingstarted.html tika/site/publish/0.5/gettingstarted.html tika/site/publish/0.6

svn commit: r1867120 - /tika/site/publish/.htaccess

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:43:41 2019 New Revision: 1867120 URL: http://svn.apache.org/viewvc?rev=1867120=rev Log: Remove the old source code page, redirect to the new one Modified: tika/site/publish/.htaccess Modified: tika/site/publish/.htaccess URL: http://svn.apache.org/viewvc

svn commit: r1867119 - /tika/site/publish/.htaccess

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:42:35 2019 New Revision: 1867119 URL: http://svn.apache.org/viewvc?rev=1867119=rev Log: Remove the old source code page, redirect to the new one Modified: tika/site/publish/.htaccess Modified: tika/site/publish/.htaccess URL: http://svn.apache.org/viewvc

svn commit: r1867118 - in /tika/site/publish: .htaccess source-repository.html

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:41:56 2019 New Revision: 1867118 URL: http://svn.apache.org/viewvc?rev=1867118=rev Log: Remove the old source code page, redirect to the new one Added: tika/site/publish/.htaccess Removed: tika/site/publish/source-repository.html Added: tika/site/publish

svn commit: r1867117 - in /tika/site/src/site/apt: 0.10/ 0.5/ 0.6/ 0.7/ 0.8/ 0.9/ 1.0/ 1.1/ 1.10/ 1.11/ 1.12/ 1.13/ 1.14/ 1.15/ 1.16/ 1.17/ 1.18/ 1.19.1/ 1.19/ 1.2/ 1.20/ 1.21/ 1.22/ 1.3/ 1.4/ 1.5/ 1.

2019-09-18 Thread nick
Author: nick Date: Wed Sep 18 13:38:59 2019 New Revision: 1867117 URL: http://svn.apache.org/viewvc?rev=1867117=rev Log: TIKA-2947 Fix source code documentation link Modified: tika/site/src/site/apt/0.10/gettingstarted.apt tika/site/src/site/apt/0.5/gettingstarted.apt tika/site/src

[tika] 03/03: Use the new RSS 2.0 file in tests too, alongside the current 0.91 one

2018-10-17 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit a0546b6cb98c949bb747b2e0e8d5675f651f6a16 Author: Nick Burch AuthorDate: Wed Oct 17 17:43:12 2018 +0100 Use the new RSS

[tika] 01/03: RSS test file is RSS v0.91, so name appropriately

2018-10-17 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 429b22b2ac9ff96cfca714895d65dce311522616 Author: Nick Burch AuthorDate: Wed Oct 17 17:15:33 2018 +0100 RSS test file

[tika] branch master updated (5310f17 -> a0546b6)

2018-10-17 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from 5310f17 TIKA-2757 -- add versions plugin new 429b22b RSS test file is RSS v0.91, so name appropriately new

[tika] branch master updated (3d5d4d8 -> 705b79c)

2018-09-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from 3d5d4d8 Merge pull request #239 from wowselim/master new b26a0cc Merge branch 'master' of https://github.com

[tika] 01/04: Merge branch 'master' of https://github.com/wowselim/tika

2018-09-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit b26a0ccdbd5620b870df0dc434d2f9265b2df082 Merge: e4f0fe5 eb33286 Author: Nick Burch AuthorDate: Wed Sep 5 20:46:56 2018

[tika] 04/04: Changes update

2018-09-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 705b79ccb6c0ad0f92a3a185bf7e66cacf899931 Author: Nick Burch AuthorDate: Thu Sep 6 09:28:24 2018 +0100 Changes update

[tika] 03/04: Mime magic for "MIME Encapsulation of Aggregate HTML Documents" (MHTML), pulled out from rfc822 (may not be fully correct long-term...)

2018-09-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 9a2c7d89e03ca7c0e821b69c394165297edfb9d4 Author: Nick Burch AuthorDate: Thu Sep 6 09:28:14 2018 +0100 Mime magic

[tika] 02/04: Merge branch 'master' of https://github.com/apache/tika

2018-09-06 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 53c8434f497795885ff129e17440881f059c1624 Merge: b26a0cc 3d5d4d8 Author: Nick Burch AuthorDate: Wed Sep 5 20:58:20 2018

[tika] branch master updated (e4f0fe5 -> 3d5d4d8)

2018-09-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from e4f0fe5 Use DateUtils to format dates to strings, rather than relying on explicit/implicit toString calls add

[tika] 01/01: Merge pull request #239 from wowselim/master

2018-09-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 3d5d4d8b9667a31e3cb30a9d02543347feefbcc7 Merge: e4f0fe5 eb33286 Author: Gagravarr AuthorDate: Wed Sep 5 20:58:07 2018 +0100

[tika] branch master updated: Use DateUtils to format dates to strings, rather than relying on explicit/implicit toString calls

2018-09-05 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new e4f0fe5 Use DateUtils to format dates to strings

[tika] 01/07: TIKA-2479 Option to request missing rows where possible in Excel-like formats

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit a1e42a0659ba33e90cb1bba0a0a10eeb97d4fac7 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 17 22:15:34 2018

[tika] 07/07: Add the other jackcess jar to the bundle

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 12693ea18f1a05894272aa3a9293d41215f63c06 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Fri May 18 15:35:06 2018

[tika] 03/07: Updated Columnar output from SAS with better formats

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit b01b059331f198d3829b111002cf03cbcaf1bab3 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Fri May 18 11:43:47 2018

[tika] 05/07: TIKA-2479 Update XLS missing cell/row handling to match XLSX and XLSB, add unit test for missing rows, and enable the Columnar tests for the Excel formats

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 348b87e7f41b79ff115e17d9c91d2dad63a57c15 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Fri May 18 15:15:32 2018

[tika] 06/07: Changelog update

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 9673fbdbba8feebb72fee569074e94b0868a89df Author: Nick Burch <n...@gagravarr.org> AuthorDate: Fri May 18 15:17:56 2018

[tika] 04/07: Formatted columns in the columnar test Excel files

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 6fa1105e0669ffeec5c3cf0d1db247a8c16f3bc5 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Fri May 18 15:13:43 2018

[tika] 02/07: TIKA-2479 Output missing left/mid cells in XLSX and XLSB, and optionally also missing rows

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit b1b035e6bbcff0db24e133b682ac79916f92f599 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 17 23:07:04 2018

[tika] branch master updated (5f05b51 -> 12693ea)

2018-05-18 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from 5f05b51 TIKA-2644 - refactor recursiveparserwrapper api new a1e42a0 TIKA-2479 Option to request missing rows

[tika] branch master updated: Mime magic for DPX and ACES, thanks to Andreas Meier (TIKA-2628 and TIKA-2629)

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new ca3207c Mime magic for DPX and ACES, thanks

[tika] 04/04: Add disabled, currently failing ODS test

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 49833d88cb323928c3de7bd7a86ab38444530418 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 17:13:24 2018

[tika] branch master updated (cfd6256 -> 49833d8)

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from cfd6256 Remaining values to check new 6cff602 Ensure that empty cells are still output new d0fb697 Not all

[tika] 03/04: Use patterns to handle the date format variations

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 72994c8ac8f0c749f26f4f19b7992b8224fc2a12 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 16:59:09 2018

[tika] 02/04: Not all formats know about %s, dates not completely consistent either...

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit d0fb69715e83a42db2ee5c2750eaa9d3b4f4d86c Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 16:33:45 2018

[tika] 01/04: Ensure that empty cells are still output

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 6cff6029beb4316e541169d788fe1884b338 Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 16:26:22 2018

[tika] 02/05: Add a time column to the test columnar files

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit ca2f5bc63b7595730e53e95758dc9aaf6b567daa Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 11:35:04 2018

[tika] branch master updated (a0ffec1 -> cfd6256)

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/tika.git. from a0ffec1 Handle .epub files using .htm rather than .html extensions for the embedded contents (TIKA-1288) new

[tika] 01/05: Add a test .sas7bdat file with labels, and generate the columnar/tabular test file in a few more formats

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit d0324f8e4fa70fce67d56dc70f611f5535fe229b Author: Nick Burch <n...@gagravarr.org> AuthorDate: Wed May 9 18:19:34 2018

[tika] 05/05: Remaining values to check

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit cfd62569a8f6bf79ba5d15bb3f4063d49347c7fd Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 15:41:16 2018

[tika] 04/05: Check header contents, check data rows count, add XLSX test

2018-05-10 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git commit 7f89db35d066e6c4ae35490c5bad67d376e5365e Author: Nick Burch <n...@gagravarr.org> AuthorDate: Thu May 10 15:13:43 2018

[tika] branch master updated: Handle .epub files using .htm rather than .html extensions for the embedded contents (TIKA-1288)

2018-05-09 Thread nick
This is an automated email from the ASF dual-hosted git repository. nick pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The following commit(s) were added to refs/heads/master by this push: new a0ffec1 Handle .epub files using .htm rather than

  1   2   3   4   5   6   7   8   >