On 7/15/22 12:01 PM, Tim Allison wrote:
If you curl the test file (GetStartedWithSmallpdf.pdf) against your
tika-server, what do you see? The test file works for me with 2.4.2-SNAPSHOT
at least. Are the files getting truncated somehow?
If you curl the test file (GetStartedWithSmallpdf.pdf) against your
tika-server, what do you see?
in journal log, only this:
Jul 15 12:24:47 mx.loc tika[1143]: INFO [qtp1837533591-23]
12:24:47,978 org.apache.tika.server.core.resource.TikaResource /tika
(application/pdf)
and, @ console, this:
https://pastebin.com/raw/Nu1RCbat
Are the files getting truncated somehow?
Perhaps? I'd guess that since curl of the source file against tika , as above,
works ok, that what's feeding tika -- namely dovecot's fts plugin -- would be a
likely candidate.