This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/tika.git.
from 1766166 TIKA-3322 -- upgrade PDFBox to 2.0.23
new 33a4f42 clean up dependencies
new 29ef4b5 TIKA-3332 -- recursively search embedded file tree for
attachments
The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
CHANGES.txt | 3 ++
tika-parent/pom.xml | 4 +-
.../apache/tika/parser/pdf/AbstractPDF2XHTML.java | 54 +++++++++++++--------
.../org/apache/tika/parser/pdf/PDFParserTest.java | 9 ++++
.../testPDF_deeplyEmbeddedAttachments.pdf | Bin 0 -> 122221 bytes
5 files changed, 50 insertions(+), 20 deletions(-)
create mode 100644
tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-pdf-module/src/test/resources/test-documents/testPDF_deeplyEmbeddedAttachments.pdf