[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112635#comment-17112635
]
Tim Allison edited comment on TIKA-3103 at 5/20/20, 9:54 PM:
-
>Maybe it's
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112635#comment-17112635
]
Tim Allison commented on TIKA-3103:
---
>Maybe it's something to do with concurrency in the Tika server
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:45 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:40 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:34 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM:
---
I take it
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Attachment: (was: Screen Shot 2020-05-20 at 17.30.04.png)
> Tesseract fails to respect timeouts
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Attachment: (was: Screen Shot 2020-05-20 at 17.28.45.png)
> Tesseract fails to respect timeouts
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Attachment: Screen Shot 2020-05-20 at 17.30.04.png
> Tesseract fails to respect timeouts and clean
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356
]
Radim Rehurek commented on TIKA-3103:
-
I take it back. There are still Tesseract processes that have
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Attachment: Screen Shot 2020-05-20 at 17.28.45.png
> Tesseract fails to respect timeouts and clean
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:48 PM:
---
FYI, in case
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:47 PM:
---
FYI, in case
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301
]
Radim Rehurek commented on TIKA-3103:
-
FYI, in case anyone hits this in the future: setting the
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 1:11 PM:
---
I confirm
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM:
I confirm
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157
]
Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM:
Thanks for
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161
]
Radim Rehurek commented on TIKA-3103:
-
I confirm reducing the `timeout` Tesseract parameter to 30
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157
]
Radim Rehurek commented on TIKA-3103:
-
Thanks for the quick response Tim.
>
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112071#comment-17112071
]
Tim Allison commented on TIKA-3103:
---
When you see that tesseract is orphaned, did the child process
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112067#comment-17112067
]
Tim Allison edited comment on TIKA-3103 at 5/20/20, 11:13 AM:
--
In reverse
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112067#comment-17112067
]
Tim Allison commented on TIKA-3103:
---
In reverse order,
{{apache-tika-12291680472524021463.tmp}} look
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Affects Version/s: (was: 1.22)
> Tesseract fails to respect timeouts and clean up after itself
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Affects Version/s: 1.22
> Tesseract fails to respect timeouts and clean up after itself
>
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar /opt/tika/tika-server-1.24.1.jar -p
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar
[
https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Rehurek updated TIKA-3103:
Description:
We're using the Tika Server with OCR:
_java -jar
Radim Rehurek created TIKA-3103:
---
Summary: Tesseract fails to respect timeouts and clean up after
itself
Key: TIKA-3103
URL: https://issues.apache.org/jira/browse/TIKA-3103
Project: Tika
43 matches
Mail list logo