Hi,

On Thu, Jun 2, 2011 at 5:00 PM, sgraessle <[email protected]> wrote:
> I went ahead and tried to piece together what I needed to do to test Tika
> with the code provided above.
> [...]
> All I really need is to use a ImageParser that will save the embedded images
> to some arbitrary directory in addition to parsing the files... is there
> some other package that I should use to perform this extraction before I
> parse the files with Tika?

It looks like you're down a much more complicated path than you'd need to be.

As Maxim noted, see the TikaCLI class and the
FileEmbeddedDocumentExtractor one inside it for an example of how the
"--extract" option of the CLI works under the hood. That should be
pretty much similar to what you're trying to achieve. No need to
implement your own parser classes, etc.

BR,

Jukka Zitting

Reply via email to