Tim Allison created TIKA-4533:
---------------------------------
Summary: DigestingParser needs to write out embedded containers
for digesting
Key: TIKA-4533
URL: https://issues.apache.org/jira/browse/TIKA-4533
Project: Tika
Issue Type: Improvement
Reporter: Tim Allison
If there's an embedded file an office document, we sometimes pass that around
as an openContainer in the TikaInputStream. The digester is not currently
translating that back to bytes for digesting.
We need to apply a StreamTranslator and digest the output of that.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)