Tim Allison created TIKA-4207:
---------------------------------
Summary: PipesParser should have option to extract raw bytes of
embedded files
Key: TIKA-4207
URL: https://issues.apache.org/jira/browse/TIKA-4207
Project: Tika
Issue Type: New Feature
Reporter: Tim Allison
There are many use cases, where text+metadata are important, but users also
need the raw bytes from embedded files.
Let's make it possible to extract the usual rmeta content in _and_ the raw
bytes. This is a preliminary step that will offer more customization options
than the proposal in TIKA-3703.
This is targeted to 3.x.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)