[
https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
dataminer.accolade updated TIKA-3683:
-------------------------------------
Description:
I created a custom Docker image using the latest Tesseract release. I came
across the tika
[Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile]
file which installs the following dependencies:
xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract
I have not found any documetation yet about those dependencies in
[https://cwiki.apache.org/confluence/display/tika] and
[https://github.com/apache/tika]. I can only guess that those dependencies
might impact PDF content handling.
was:
I created a custom Docker image using the latest Tesseract version. I came
across the tika
[Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile]
file which installs the following dependencies:
xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract
I have not found any documetation yet about those dependencies in
[https://cwiki.apache.org/confluence/display/tika] and
[https://github.com/apache/tika]. I can only guess that those dependencies
might impact PDF content handling.
> Documentation of native dependencies per module
> -----------------------------------------------
>
> Key: TIKA-3683
> URL: https://issues.apache.org/jira/browse/TIKA-3683
> Project: Tika
> Issue Type: Wish
> Components: tika-docker, tika-server
> Reporter: dataminer.accolade
> Priority: Minor
>
> I created a custom Docker image using the latest Tesseract release. I came
> across the tika
> [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile]
> file which installs the following dependencies:
> xfonts-utils
> fonts-freefont-ttf
> fonts-liberation
> ttf-mscorefonts-installer
> cabextract
> I have not found any documetation yet about those dependencies in
> [https://cwiki.apache.org/confluence/display/tika] and
> [https://github.com/apache/tika]. I can only guess that those dependencies
> might impact PDF content handling.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)