This is an automated email from the ASF dual-hosted git repository.
thammegowda pushed a change to branch gsoc17
in repository https://gitbox.apache.org/repos/asf/tika.git.
from 65ef6d8 record change for TIKA-2262
add 9f0144c TIKA-2426 -- fix locale-sensitive test for xlsb
add c23c648 update tag
add bea5b9d WARC and ARC magic from Andy Jackson from
https://github.com/ukwa/tika/
add 068c87c Merge remote-tracking branch 'origin/master'
add 0277fbb TIKA-2042 Add a few more mbox patterns, based on file
supplied by mcaruanagalizia Matthew Caruana Galizia
add 9869851 TIKA-2430 -- add a capability to allow devs to easily run
parsers against randomly corrupted files.
add 55caab7 POIFSContainerDetector ASCII-encoded magic number
add 898946e Merge branch 'patch-1' of https://github.com/onealj/tika
add 08d09a5 Update WordMLParser.java
add 71a80c9 Merge branch 'patch-2' of https://github.com/onealj/tika into
onealj-patch2
add 12cce58 Merge branch 'onealj-patch2'
add 9687f08 SUPPORTED_TYPES is an immutable singleton set
add 13ebacf Fix conflicts. This closes #193
add f3acaed Fix conflicts. This closes #193. Thank you, Javen!
add 3de4c4f TIKA-2430 -- allow devs to fuzz embedded files individually
add 3f86a6b TIKA-2430 -- remove dev ignore emf
add d57a621 Fix a typo in log message, and adjust code indentation
add 8af9c96 Merge pull request #195 from kinow/fix-typo-and-indentation
add f53a2f2 SUPPORTED_TYPES is immutable
add 30b27ab Merge pull request #194 from onealj/patch-5
add 0579efe Two more EML header magics from Matthew Caruana Galizia from
TIKA-2042
add 00221ad TIKA-2042 -- fix typo.
add 1bf3a7e TIKA-2431 -- upgrade to PDFBox 2.0.7
add 268a815 TIKA 2262 : Update links
add 0d5cab9 TIKA 2262 : Minor changes to dockerfile
add ef12f7d Merge remote-tracking branch 'upstream/master'
add c92730c Minor changes
add 63ae47a Merge pull request #196 from ThejanW/master
add f31b7f1 TIKA-2433 All non-pipe modes need configuring, otherwise the
Tika Server fails
add a51add2 Forbidden APIs fix - Use a specified encoding when turning
Strings into Bytes
add 4455a6f TIKA-2436 Add a mime type for EMZ, subclass of gzip, much as
we have for the related WMZ
No new revisions were added by this update.
Summary of changes:
CHANGES.txt | 6 +-
.../src/main/java/org/apache/tika/cli/TikaCLI.java | 3 +-
.../org/apache/tika/mime/tika-mimetypes.xml | 34 ++-
.../tika/parser/recognition/AgeRecogniser.java | 17 +-
tika-parent/pom.xml | 2 +-
tika-parsers/pom.xml | 2 +-
.../captioning/tf/TensorflowRESTCaptioner.java | 4 +-
.../parser/microsoft/POIFSContainerDetector.java | 21 +-
.../tika/parser/microsoft/xml/WordMLParser.java | 38 ++-
.../tika/parser/captioning/tf/Im2txtRestDockerfile | 26 +-
.../tika/parser/captioning/tf/model_info.xml | 4 +-
.../java/org/apache/tika/TestCorruptedFiles.java | 272 +++++++++++++++++++++
.../java/org/apache/tika/mime/TestMimeTypes.java | 7 +-
.../parser/microsoft/ooxml/OOXMLParserTest.java | 15 ++
14 files changed, 389 insertions(+), 62 deletions(-)
create mode 100644
tika-parsers/src/test/java/org/apache/tika/TestCorruptedFiles.java
--
To stop receiving notification emails like this one, please contact
['"[email protected]" <[email protected]>'].