Hello Tika list,
I try to extract the content from a pdf file. Running the code on my local
system (from inside a java main() method) works fine - i.e the content of the
pdf file is extracted as expected.
But executing the same code at a servlet only extracts an empty string. There's
also no exception or error message logged. But the uploaded file is recognized
as: Content-Type=application/pdf
(Also note that all other file types like epub, txt, html, jar work fine on the
servlet)
Do you have any ideas what could cause tika to extract only an empty string if
executed on a servlet? Or any ideas how to fix it? Maybe there's anything
special that must be considered if tika is used at a servlet?
Thanks in advance for any hints!