This is an automated email from the ASF dual-hosted git repository.
tballison pushed a change to branch TIKA-4728-js-in-pdf
in repository https://gitbox.apache.org/repos/asf/tika.git
from 155d2f6806 TIKA-4728 - add strict validation as an option
add 32d5f9e01d TIKA-4728 - further tag fixes
No new revisions were added by this update.
Summary of changes:
.../microsoft/ooxml/AbstractOOXMLExtractor.java | 12 ++++
.../microsoft/ooxml/OOXMLTikaBodyPartHandler.java | 38 +++++++++++
.../ooxml/OOXMLWordAndPowerPointTextHandler.java | 7 ++
.../ooxml/SXSLFPowerPointExtractorDecorator.java | 16 ++++-
.../ooxml/SXWPFWordExtractorDecorator.java | 6 ++
.../ooxml/XSSFExcelExtractorDecorator.java | 36 ++++++++++
.../org/apache/tika/parser/epub/EpubParser.java | 55 +++++++++++++---
.../tika/parser/odf/OpenDocumentBodyHandler.java | 77 ++++++++++++++++++++++
.../apache/tika/parser/pdf/AbstractPDF2XHTML.java | 4 +-
9 files changed, 239 insertions(+), 12 deletions(-)