Slava G created TIKA-2832:
-
Summary: Very slow large PDF text extraction
Key: TIKA-2832
URL: https://issues.apache.org/jira/browse/TIKA-2832
Project: Tika
Issue Type: Bug
Components:
[
https://issues.apache.org/jira/browse/TIKA-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777934#comment-16777934
]
Tim Allison commented on TIKA-2833:
---
Initial question is where to place this detector. It should only be
[
https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777970#comment-16777970
]
Tim Allison edited comment on TIKA-2832 at 2/26/19 2:28 PM:
Please ask this
Tim Allison created TIKA-2833:
-
Summary: Add a CSV/TSV detector
Key: TIKA-2833
URL: https://issues.apache.org/jira/browse/TIKA-2833
Project: Tika
Issue Type: Task
Reporter: Tim
[
https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777970#comment-16777970
]
Tim Allison commented on TIKA-2832:
---
Do you have tesseract installed?
> Very slow large PDF text
[
https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777982#comment-16777982
]
Slava G commented on TIKA-2832:
---
Thanks, I'll, but such a slow parsing, isn't a bug ? For me it's looks like