[jira] [Created] (TIKA-2832) Very slow large PDF text extraction

2019-02-26 Thread Slava G (JIRA)
Slava G created TIKA-2832: - Summary: Very slow large PDF text extraction Key: TIKA-2832 URL: https://issues.apache.org/jira/browse/TIKA-2832 Project: Tika Issue Type: Bug Components:

[jira] [Commented] (TIKA-2833) Add a CSV/TSV detector

2019-02-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777934#comment-16777934 ] Tim Allison commented on TIKA-2833: --- Initial question is where to place this detector. It should only be

[jira] [Comment Edited] (TIKA-2832) Very slow large PDF text extraction

2019-02-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777970#comment-16777970 ] Tim Allison edited comment on TIKA-2832 at 2/26/19 2:28 PM: Please ask this

[jira] [Created] (TIKA-2833) Add a CSV/TSV detector

2019-02-26 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2833: - Summary: Add a CSV/TSV detector Key: TIKA-2833 URL: https://issues.apache.org/jira/browse/TIKA-2833 Project: Tika Issue Type: Task Reporter: Tim

[jira] [Commented] (TIKA-2832) Very slow large PDF text extraction

2019-02-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777970#comment-16777970 ] Tim Allison commented on TIKA-2832: --- Do you have tesseract installed? > Very slow large PDF text

[jira] [Commented] (TIKA-2832) Very slow large PDF text extraction

2019-02-26 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777982#comment-16777982 ] Slava G commented on TIKA-2832: --- Thanks, I'll, but such a slow parsing, isn't a bug ? For me it's looks like