Tika unable to extract PDF Text

Adam Retter Wed, 14 Oct 2015 09:14:23 -0700

I have a PDF which was created using Apache PDF Box 2.0.0-SNAPSHOT.
Unfortunately Tika 1.10 seems unable to extract any text from the PDF,
I don't get any exceptions or errors. The code is as simple as:


new Tika().parseToString(new FileInputStream(f))

Tika is always returning just the empty string.

The PDF is available here - http://static.adamretter.org.uk/adam-1.pdf

Any ideas?

-- 
Adam Retter

skype: adam.retter
tweet: adamretter
http://www.adamretter.org.uk

Tika unable to extract PDF Text

Reply via email to