I have a PDF which was created using Apache PDF Box 2.0.0-SNAPSHOT. Unfortunately Tika 1.10 seems unable to extract any text from the PDF, I don't get any exceptions or errors. The code is as simple as:
new Tika().parseToString(new FileInputStream(f)) Tika is always returning just the empty string. The PDF is available here - http://static.adamretter.org.uk/adam-1.pdf Any ideas? -- Adam Retter skype: adam.retter tweet: adamretter http://www.adamretter.org.uk
