[jira] [Closed] (TIKA-1753) Improper word concatenation when extracting pdf

2015-10-16 Thread Ben McCann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben McCann closed TIKA-1753. Resolution: Later > Improper word concatenation when extracting pdf >

[jira] [Commented] (TIKA-1753) Improper word concatenation when extracting pdf

2015-10-16 Thread Ben McCann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960254#comment-14960254 ] Ben McCann commented on TIKA-1753: -- Closing this since it's being tracked at PDFBOX now > Improper word

[GitHub] tika pull request: fix for TIKA-1772 contributed by wiedsche

2015-10-16 Thread wiedsche
GitHub user wiedsche opened a pull request: https://github.com/apache/tika/pull/59 fix for TIKA-1772 contributed by wiedsche You can merge this pull request into a Git repository by running: $ git pull https://github.com/wiedsche/tika TIKA-1772 Alternatively you can review

[jira] [Commented] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960290#comment-14960290 ] ASF GitHub Bot commented on TIKA-1772: -- GitHub user wiedsche opened a pull request:

[jira] [Created] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Alexander Widera (JIRA)
Alexander Widera created TIKA-1772: -- Summary: Mimetype of VTT files Key: TIKA-1772 URL: https://issues.apache.org/jira/browse/TIKA-1772 Project: Tika Issue Type: Improvement

[jira] [Updated] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Alexander Widera (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Widera updated TIKA-1772: --- Attachment: upc-video-subtitles-en.vtt Added example vtt file as attachment. Thanks for

[jira] [Commented] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960504#comment-14960504 ] Hudson commented on TIKA-1772: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #869 (See

[jira] [Commented] (TIKA-1773) No XML Metadata output for JP2 files

2015-10-16 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960503#comment-14960503 ] Hudson commented on TIKA-1773: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #869 (See

[jira] [Created] (TIKA-1773) No XML Metadata output for JP2 files

2015-10-16 Thread Andreas Hirtzel (JIRA)
Andreas Hirtzel created TIKA-1773: - Summary: No XML Metadata output for JP2 files Key: TIKA-1773 URL: https://issues.apache.org/jira/browse/TIKA-1773 Project: Tika Issue Type: Bug

[jira] [Updated] (TIKA-1773) No XML Metadata output for JP2 files

2015-10-16 Thread Andreas Hirtzel (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Hirtzel updated TIKA-1773: -- Attachment: testJPEG.jp2 converted testfile (using Photoshop) > No XML Metadata output for JP2

[jira] [Commented] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960454#comment-14960454 ] Nick Burch commented on TIKA-1772: -- Thanks for the patch! Couple of minor points - we normally sort the

[jira] [Commented] (TIKA-1773) No XML Metadata output for JP2 files

2015-10-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960444#comment-14960444 ] Nick Burch commented on TIKA-1773: -- Are you able to convert an existing Tika test image (eg

[jira] [Commented] (TIKA-1773) No XML Metadata output for JP2 files

2015-10-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960599#comment-14960599 ] Nick Burch commented on TIKA-1773: -- Ah, I think I've found the issue. Based on

[jira] [Comment Edited] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960722#comment-14960722 ] Nick Burch edited comment on TIKA-1772 at 10/16/15 1:46 PM: Thanks for that.

[jira] [Resolved] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1772. -- Resolution: Fixed Fix Version/s: 1.11 Thanks for that. Looks like we can also do mime magic

[jira] [Commented] (TIKA-1772) Mimetype of VTT files

2015-10-16 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960821#comment-14960821 ] Hudson commented on TIKA-1772: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #871 (See

[jira] [Commented] (TIKA-1358) Add support for newer iWork file formats

2015-10-16 Thread Ben Summers (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960963#comment-14960963 ] Ben Summers commented on TIKA-1358: --- Evernote have kindly open sourced some code to extract text from