[jira] [Commented] (TIKA-2646) Tika parse["content"] returns jumbled text across cells of a table in a pdf

2018-05-22 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486646#comment-16486646 ] Luis Filipe Nassif commented on TIKA-2646: -- It does not maintain table structures, but have you

[jira] [Commented] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486556#comment-16486556 ] Hudson commented on TIKA-2645: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1490 (See

[jira] [Commented] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486555#comment-16486555 ] Hudson commented on TIKA-2645: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #29 (See

[jira] [Commented] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486525#comment-16486525 ] Hudson commented on TIKA-2645: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #256 (See

[jira] [Resolved] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2645. --- Resolution: Fixed > Reuse SAXParsers where possible > --- > >

[jira] [Reopened] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-2645: --- Clean up code to encapsulate acquire/release from parsers. We don’t want them to be responsible. Also, do

[jira] [Commented] (TIKA-2646) Tika parse["content"] returns jumbled text across cells of a table in a pdf

2018-05-22 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16484465#comment-16484465 ] Tim Allison commented on TIKA-2646: --- Y, it might be painful to try to coordinate tabula and our PDFBox

[jira] [Commented] (TIKA-2646) Tika parse["content"] returns jumbled text across cells of a table in a pdf

2018-05-22 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16484338#comment-16484338 ] Chris A. Mattmann commented on TIKA-2646: - Tim thanks - this is for a project at JPL and I asked

[jira] [Commented] (TIKA-2645) Reuse SAXParsers where possible

2018-05-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16484159#comment-16484159 ] Sebastian Nagel commented on TIKA-2645: --- Thanks, [~talli...@apache.org]! Great to here, that it

[jira] [Updated] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html"

2018-05-22 Thread Gerard Bouchar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gerard Bouchar updated TIKA-2648: - Summary: mime detection based on resource name detects resources as "text/x-php" instead of

[jira] [Created] (TIKA-2648) resource name based mime detection detects elements as "text/x-php" instead of "text/html"

2018-05-22 Thread Gerard Bouchar (JIRA)
Gerard Bouchar created TIKA-2648: Summary: resource name based mime detection detects elements as "text/x-php" instead of "text/html" Key: TIKA-2648 URL: https://issues.apache.org/jira/browse/TIKA-2648

Re: [jira] [Created] (TIKA-2647) Create a "security" page on our website

2018-05-22 Thread Oleg Tikhonov
Hi Tim, definitely would be helpful ! +1 Thanks, Oleg On Tue, May 22, 2018 at 3:38 PM, Tim Allison (JIRA) wrote: > Tim Allison created TIKA-2647: > - > > Summary: Create a "security" page on our website > Key:

[jira] [Created] (TIKA-2647) Create a "security" page on our website

2018-05-22 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2647: - Summary: Create a "security" page on our website Key: TIKA-2647 URL: https://issues.apache.org/jira/browse/TIKA-2647 Project: Tika Issue Type: New Feature

[jira] [Resolved] (TIKA-2646) Tika parse["content"] returns jumbled text across cells of a table in a pdf

2018-05-22 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2646. --- Resolution: Won't Fix [~adidier] thank you for opening this issue and sharing this with us. PDFs