[jira] [Comment Edited] (TIKA-1020) Excel 2010 parser missing cell values are not reported resulting in missing columns values

2018-03-07 Thread Radim Rehurek (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389884#comment-16389884 ] Radim Rehurek edited comment on TIKA-1020 at 3/7/18 5:57 PM: - We just hit this

[jira] [Comment Edited] (TIKA-1020) Excel 2010 parser missing cell values are not reported resulting in missing columns values

2018-03-07 Thread Radim Rehurek (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389884#comment-16389884 ] Radim Rehurek edited comment on TIKA-1020 at 3/7/18 5:57 PM: - We just hit this

[jira] [Comment Edited] (TIKA-1020) Excel 2010 parser missing cell values are not reported resulting in missing columns values

2018-03-07 Thread Radim Rehurek (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389884#comment-16389884 ] Radim Rehurek edited comment on TIKA-1020 at 3/7/18 5:56 PM: - We just hit this

[jira] [Commented] (TIKA-1020) Excel 2010 parser missing cell values are not reported resulting in missing columns values

2018-03-07 Thread Radim Rehurek (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389884#comment-16389884 ] Radim Rehurek commented on TIKA-1020: - We just hit this bug too. I say "bug" because Excel

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:44 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:37 PM: --- I take it

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek commented on TIKA-3103: - I confirm reducing the `timeout` Tesseract parameter to 30

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek commented on TIKA-3103: - FYI, in case anyone hits this in the future: setting the

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek commented on TIKA-3103: - I take it back. There are still Tesseract processes that have

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: Screen Shot 2020-05-20 at 17.30.04.png > Tesseract fails to respect timeouts and clean

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: Screen Shot 2020-05-20 at 17.28.45.png > Tesseract fails to respect timeouts and clean

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157 ] Radim Rehurek commented on TIKA-3103: - Thanks for the quick response Tim. > 

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:40 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:45 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:47 PM: --- FYI, in case

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112301#comment-17112301 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 2:48 PM: --- FYI, in case

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:34 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112157#comment-17112157 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM: Thanks for

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 12:55 PM: I confirm

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:32 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM: --- I take it

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112356#comment-17112356 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 3:33 PM: --- I take it

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: (was: Screen Shot 2020-05-20 at 17.30.04.png) > Tesseract fails to respect timeouts

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Attachment: (was: Screen Shot 2020-05-20 at 17.28.45.png) > Tesseract fails to respect timeouts

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112161#comment-17112161 ] Radim Rehurek edited comment on TIKA-3103 at 5/20/20, 1:11 PM: --- I confirm

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-28 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118916#comment-17118916 ] Radim Rehurek edited comment on TIKA-3103 at 5/28/20, 5:21 PM: --- We do use

[jira] [Commented] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-28 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118916#comment-17118916 ] Radim Rehurek commented on TIKA-3103: - We do use Linux, so your option 1) would work. We ended up

[jira] [Comment Edited] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-28 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118916#comment-17118916 ] Radim Rehurek edited comment on TIKA-3103 at 5/28/20, 5:21 PM: --- We do use

[jira] [Created] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
Radim Rehurek created TIKA-3103: --- Summary: Tesseract fails to respect timeouts and clean up after itself Key: TIKA-3103 URL: https://issues.apache.org/jira/browse/TIKA-3103 Project: Tika

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Affects Version/s: 1.22 > Tesseract fails to respect timeouts and clean up after itself >

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Affects Version/s: (was: 1.22) > Tesseract fails to respect timeouts and clean up after itself

[jira] [Updated] (TIKA-3103) Tesseract fails to respect timeouts and clean up after itself

2020-05-20 Thread Radim Rehurek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Radim Rehurek updated TIKA-3103: Description: We're using the Tika Server with OCR: _java -jar /opt/tika/tika-server-1.24.1.jar -p