What happens when you try to do the same with tika-app from the command
line?
Tilman
Am 22.07.2020 um 17:06 schrieb Shaojun Ni:
Hi There,
We try to use Tika docker image 1.24.1 to extract images from various
documents. After testing several files, we found following issues:
1. We can extract images out of pdf.
The result comes back from Tika is:
status code=204, reason=No Content
1. We can’t extract images out of Microsoft office document when the file
size is over 1Mb.
The result comes back from Tika is:
status code=422, reason=Unprocessable Entity
Do you have any idea what went wrong? Thanks.
Shaojun Ni