[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM: --- I take it

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek commented on TIKA-3103: - I confirm reducing the `timeout` Tesseract parameter to 30

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek commented on TIKA-3103: - FYI, in case anyone hits this in the future: setting the

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek commented on TIKA-3103: - I take it back. There are still Tesseract processes that have

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: Screen Shot 2020-05-20 at 17.30.04.png > Tesseract fails to respect timeouts and clean

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: Screen Shot 2020-05-20 at 17.28.45.png > Tesseract fails to respect timeouts and clean

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157 ] Radim Rehurek commented on TIKA-3103: - Thanks for the quick response Tim. > 

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:40 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:45 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:47 PM: --- FYI, in case

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:48 PM: --- FYI, in case

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:34 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM: Thanks for

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM: I confirm

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM: --- I take it

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: (was: Screen Shot 2020-05-20 at 17.30.04.png) > Tesseract fails to respect timeouts

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: (was: Screen Shot 2020-05-20 at 17.28.45.png) > Tesseract fails to respect timeouts

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 1:11 PM: --- I confirm

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112635#comment-17112635 ] Tim Allison edited comment on TIKA-3103 at 5/20/20, 9:54 PM: - >Maybe it's

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112635#comment-17112635 ] Tim Allison commented on TIKA-3103: --- >Maybe it's something to do with concurrency in the Tika server

[jira] [Created] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
Radim Rehurek created TIKA-3103: --- Summary: Tesseract fails to respect timeouts and clean up after itself Key: TIKA-3103 URL: https://issues.apache.org/jira/browse/TIKA-3103 Project: Tika

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Affects Version/s: 1.22 > Tesseract fails to respect timeouts and clean up after itself >

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Affects Version/s: (was: 1.22) > Tesseract fails to respect timeouts and clean up after itself

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112067#comment-17112067 ] Tim Allison commented on TIKA-3103: --- In reverse order, {{apache-tika-12291680472524021463.tmp}} look

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112067#comment-17112067 ] Tim Allison edited comment on TIKA-3103 at 5/20/20, 11:13 AM: -- In reverse

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112071#comment-17112071 ] Tim Allison commented on TIKA-3103: --- When you see that tesseract is orphaned, did the child process