+1 to Nick's links and advice.
To use the RecursiveParserWrapper with tika-app, use the -J option; or if
you're using tika-server, use the /rmeta endpoint.
The ecology of embedded docs is rich and understudied (IMHO), let us know what
you find!
Cheers,
Tim
-Original Mes
Thanks for the information!
Much appreciated!
Anthony
-Original Message-
From: Nick Burch [mailto:apa...@gagravarr.org]
Sent: 27 March 2018 15:50
To: user@tika.apache.org
Subject: Re: Subfile Extraction
On Sun, 25 Mar 2018, McGreevy, Anthony wrote:
> I am currently playing with Tika to
On Sun, 25 Mar 2018, McGreevy, Anthony wrote:
I am currently playing with Tika to see how it works with regards to
extraction of subfiles.
Do you mean files or resources embedded within another file?
If so... With the Tika App, you want -z to have these extracted. With the
Tika java classes,