RE: Subfile Extraction

2018-03-27 Thread Allison, Timothy B.
+1 to Nick's links and advice. To use the RecursiveParserWrapper with tika-app, use the -J option; or if you're using tika-server, use the /rmeta endpoint. The ecology of embedded docs is rich and understudied (IMHO), let us know what you find! Cheers, Tim -Original Mes

RE: Subfile Extraction

2018-03-27 Thread McGreevy, Anthony
Thanks for the information! Much appreciated! Anthony -Original Message- From: Nick Burch [mailto:apa...@gagravarr.org] Sent: 27 March 2018 15:50 To: user@tika.apache.org Subject: Re: Subfile Extraction On Sun, 25 Mar 2018, McGreevy, Anthony wrote: > I am currently playing with Tika to

Re: Subfile Extraction

2018-03-27 Thread Nick Burch
On Sun, 25 Mar 2018, McGreevy, Anthony wrote: I am currently playing with Tika to see how it works with regards to extraction of subfiles. Do you mean files or resources embedded within another file? If so... With the Tika App, you want -z to have these extracted. With the Tika java classes,