[ 
https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

dataminer.accolade updated TIKA-3683:
-------------------------------------
    Description: 
I created a custom Docker image using the latest Tesseract release. I came 
across the tika 
[Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] 
file which installs the following dependencies:

xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract

I have not found any documetation yet about those dependencies in 
[https://cwiki.apache.org/confluence/display/tika] and 
[https://github.com/apache/tika]. I can only guess that those dependencies 
might impact PDF content handling.

  was:
I created a custom Docker image using the latest Tesseract version. I came 
across the tika 
[Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] 
file which installs the following dependencies:

xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract

I have not found any documetation yet about those dependencies in 
[https://cwiki.apache.org/confluence/display/tika] and 
[https://github.com/apache/tika]. I can only guess that those dependencies 
might impact PDF content handling.


> Documentation of native dependencies per module
> -----------------------------------------------
>
>                 Key: TIKA-3683
>                 URL: https://issues.apache.org/jira/browse/TIKA-3683
>             Project: Tika
>          Issue Type: Wish
>          Components: tika-docker, tika-server
>            Reporter: dataminer.accolade
>            Priority: Minor
>
> I created a custom Docker image using the latest Tesseract release. I came 
> across the tika 
> [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile]
>  file which installs the following dependencies:
> xfonts-utils
> fonts-freefont-ttf
> fonts-liberation
> ttf-mscorefonts-installer
> cabextract
> I have not found any documetation yet about those dependencies in 
> [https://cwiki.apache.org/confluence/display/tika] and 
> [https://github.com/apache/tika]. I can only guess that those dependencies 
> might impact PDF content handling.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to