Re: Tika 1.15.1? -> 1.16

2017-07-05 Thread Luís Filipe Nassif
Hi Tim, Taking a fast look at Nick's fix on TIKA-2419 seems conservative to me, restricted to corrupted xml, so I think there is no need to rerun the regression tests. So +1 from me, ++1 with age detection :) 2017-07-05 22:35 GMT-03:00 Allison, Timothy B. : > All, > I'm

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075859#comment-16075859 ] Chris A. Mattmann commented on TIKA-2298: - fixed, was a simple typo - you forgot to set the config

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075855#comment-16075855 ] Chris A. Mattmann commented on TIKA-2298: - docs added in:

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075849#comment-16075849 ] Chris A. Mattmann commented on TIKA-2298: - [~talli...@apache.org] your latest update causes Jenkins

[jira] [Commented] (TIKA-2420) Jackcess toSQLString throws UnsupportedOperationException for unknown query type

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075788#comment-16075788 ] Hudson commented on TIKA-2420: -- FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075786#comment-16075786 ] Hudson commented on TIKA-2298: -- FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See

[jira] [Commented] (TIKA-2089) Macros not extracted from ppt files

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075787#comment-16075787 ] Hudson commented on TIKA-2089: -- FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See

Re: Tika 1.15.1? -> 1.16

2017-07-05 Thread Chris Mattmann
Tim I really think I can get AgeDetection in. Let me try now. Then I’m +1. I’m +1 either way ( On 7/5/17, 6:35 PM, "Allison, Timothy B." wrote: All, I'm waiting to get some resolution on TIKA-2399. The regression tests came back with nothing surprising. I

RE: Tika 1.15.1? -> 1.16

2017-07-05 Thread Allison, Timothy B.
All, I'm waiting to get some resolution on TIKA-2399. The regression tests came back with nothing surprising. I fixed the npe that they uncovered in the new ppt macro extraction code. Will I need to rerun with the updates to mime detection that Nick just made? Or are we good enough to go

[jira] [Updated] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-2298: Labels: ObjectRecognitionParser gsoc memex (was: ObjectRecognitionParser memex) > To

[jira] [Updated] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-2298: Labels: ObjectRecognitionParser memex (was: ObjectRecognitionParser) > To improve object

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075755#comment-16075755 ] Chris A. Mattmann commented on TIKA-2298: - YES sounds perfect thanks [~talli...@apache.org] > To

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075743#comment-16075743 ] Tim Allison commented on TIKA-2298: --- I'm having the usual proxy problems in my environment with the

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075740#comment-16075740 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Resolved] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-2298. - Resolution: Fixed Assignee: Chris A. Mattmann Thanks to [~asmehra95] and

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075732#comment-16075732 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on a change in pull request #182:

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075730#comment-16075730 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann closed pull request #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075729#comment-16075729 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on a change in pull request #182:

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075727#comment-16075727 ] ASF GitHub Bot commented on TIKA-2298: -- tballison commented on a change in pull request #182: Creation

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075728#comment-16075728 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075725#comment-16075725 ] ASF GitHub Bot commented on TIKA-2298: -- thammegowda commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075718#comment-16075718 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2089) Macros not extracted from ppt files

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075710#comment-16075710 ] Hudson commented on TIKA-2089: -- FAILURE: Integrated in Jenkins build Tika-trunk #1309 (See

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075711#comment-16075711 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Comment Edited] (TIKA-2421) HTML Encoding Detector should ignore UTF-16 and UTF-32

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075700#comment-16075700 ] Tim Allison edited comment on TIKA-2421 at 7/6/17 12:40 AM: Y, I was wondering

[jira] [Commented] (TIKA-2421) HTML Encoding Detector should ignore UTF-16 and UTF-32

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075700#comment-16075700 ] Tim Allison commented on TIKA-2421: --- Y, I was wondering about this case. > HTML Encoding Detector should

[jira] [Commented] (TIKA-2420) Jackcess toSQLString throws UnsupportedOperationException for unknown query type

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075699#comment-16075699 ] Tim Allison commented on TIKA-2420: --- Do we know which types of query will throw this? Might this change

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075678#comment-16075678 ] ASF GitHub Bot commented on TIKA-2298: -- thammegowda commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075677#comment-16075677 ] ASF GitHub Bot commented on TIKA-2298: -- thammegowda commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-1367) Tika documentation should list tika-parsers parser dependencies

2017-07-05 Thread Gus Heck (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075614#comment-16075614 ] Gus Heck commented on TIKA-1367: Maven Shade, Gradle Shadow and OneJar plugins for both Maven and Gradle

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075605#comment-16075605 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075602#comment-16075602 ] ASF GitHub Bot commented on TIKA-2298: -- chrismattmann commented on issue #182: Creation of TIKA-2298

[jira] [Created] (TIKA-2421) HTML Encoding Detector should ignore UTF-16 and UTF-32

2017-07-05 Thread Nick C (JIRA)
Nick C created TIKA-2421: Summary: HTML Encoding Detector should ignore UTF-16 and UTF-32 Key: TIKA-2421 URL: https://issues.apache.org/jira/browse/TIKA-2421 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-2420) Jackcess toSQLString throws UnsupportedOperationException for unknown query type

2017-07-05 Thread Nick C (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075572#comment-16075572 ] Nick C commented on TIKA-2420: -- I'm currently unable to share the document that causes the issue. > Jackcess

[jira] [Created] (TIKA-2420) Jackcess toSQLString throws UnsupportedOperationException for unknown query type

2017-07-05 Thread Nick C (JIRA)
Nick C created TIKA-2420: Summary: Jackcess toSQLString throws UnsupportedOperationException for unknown query type Key: TIKA-2420 URL: https://issues.apache.org/jira/browse/TIKA-2420 Project: Tika

error in tika-bundle: tika-seraialization was removed?

2017-07-05 Thread Chris Mattmann
Anyone else seeing build errors in tika-bundle since tika-serialization was removed? I had to implement the following patch to fix it: LMC-053601:tika-bundle mattmann$ git diff ab4ea4724e52fb5718a9d8ea86af96425fb87c7b diff --git a/tika-bundle/pom.xml b/tika-bundle/pom.xml index

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075221#comment-16075221 ] ASF GitHub Bot commented on TIKA-2298: -- asmehra95 commented on issue #182: Creation of TIKA-2298

[jira] [Comment Edited] (TIKA-2419) Try HTML mime magic on broken XML files

2017-07-05 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075006#comment-16075006 ] Luis Filipe Nassif edited comment on TIKA-2419 at 7/5/17 4:03 PM: -- Hi

[jira] [Commented] (TIKA-2419) Try HTML mime magic on broken XML files

2017-07-05 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075006#comment-16075006 ] Luis Filipe Nassif commented on TIKA-2419: -- Hi Nick, The original issue of eml(x) being detected

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074923#comment-16074923 ] Tim Allison commented on TIKA-2399: --- Great. Thank you! > Version conflict with non-ASL

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Matthew Caruana Galizia (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074918#comment-16074918 ] Matthew Caruana Galizia commented on TIKA-2399: --- I've emailed Unidata to ask about publishing

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074889#comment-16074889 ] ASF GitHub Bot commented on TIKA-2298: -- thammegowda commented on issue #182: Creation of TIKA-2298

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Matthew Caruana Galizia (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074881#comment-16074881 ] Matthew Caruana Galizia commented on TIKA-2399: --- Tim, see

[jira] [Commented] (TIKA-2419) Try HTML mime magic on broken XML files

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074872#comment-16074872 ] Hudson commented on TIKA-2419: -- FAILURE: Integrated in Jenkins build Tika-trunk #1308 (See

[jira] [Comment Edited] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843 ] Tim Allison edited comment on TIKA-2399 at 7/5/17 2:30 PM: --- [~mcaruanagalizia],

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074851#comment-16074851 ] Tim Allison commented on TIKA-2399: --- https://github.com/Unidata/jj2000/issues/6 > Version conflict with

[jira] [Comment Edited] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843 ] Tim Allison edited comment on TIKA-2399 at 7/5/17 2:23 PM: --- [~mcaruanagalizia],

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843 ] Tim Allison commented on TIKA-2399: --- [~mcaruanagalizia], would you be able to ask Unidata if they'd mind

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074825#comment-16074825 ] Nick Burch commented on TIKA-2399: -- We can properly fix this in 2.x when we sort out how to have multiple

[jira] [Commented] (TIKA-2419) Try HTML mime magic on broken XML files

2017-07-05 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074806#comment-16074806 ] Nick Burch commented on TIKA-2419: -- One fix might be to drop the priority of the XML magic to 40 to match

[jira] [Created] (TIKA-2419) Try HTML mime magic on broken XML files

2017-07-05 Thread Nick Burch (JIRA)
Nick Burch created TIKA-2419: Summary: Try HTML mime magic on broken XML files Key: TIKA-2419 URL: https://issues.apache.org/jira/browse/TIKA-2419 Project: Tika Issue Type: Bug

RE: FW: Tika content detection and crawled "remote" content

2017-07-05 Thread Allison, Timothy B.
> The initial intention is, of course, to help to improve the MIME detection in > Tika core. Absolutely agree. > Yes, you'll get few 10,000 more (MS)Office documents thanks to Tika: Tika-1.15 HTTP-Content-Type 12520application/x-tika-msoffice

[jira] [Commented] (TIKA-2418) English ASCII text classified as video/quicktime

2017-07-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074703#comment-16074703 ] Hudson commented on TIKA-2418: -- FAILURE: Integrated in Jenkins build Tika-trunk #1307 (See

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074656#comment-16074656 ] Tim Allison commented on TIKA-2399: --- [~gagravarr], any opinion on this one for 1.16? > Version conflict

[jira] [Commented] (TIKA-2418) English ASCII text classified as video/quicktime

2017-07-05 Thread Christopher Creutzig (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074654#comment-16074654 ] Christopher Creutzig commented on TIKA-2418: That was quick, thanks! > English ASCII text

[jira] [Commented] (TIKA-2418) English ASCII text classified as video/quicktime

2017-07-05 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074649#comment-16074649 ] Nick Burch commented on TIKA-2418: -- Hopefully fixed in 0815b2144cf013e1a0803cee72d8076e8c544716 - I've

[jira] [Resolved] (TIKA-2418) English ASCII text classified as video/quicktime

2017-07-05 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-2418. -- Resolution: Fixed Fix Version/s: 1.16 > English ASCII text classified as video/quicktime >

[jira] [Commented] (TIKA-1367) Tika documentation should list tika-parsers parser dependencies

2017-07-05 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074648#comment-16074648 ] Nick Burch commented on TIKA-1367: -- [~talli...@mitre.org] I'm not sure there is - we've fixed it in Tika

[jira] [Commented] (TIKA-1367) Tika documentation should list tika-parsers parser dependencies

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074615#comment-16074615 ] Tim Allison commented on TIKA-1367: --- Fellow devs, is this something we need to fix for 1.16? > Tika

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074601#comment-16074601 ] Tim Allison commented on TIKA-2399: --- Y, probably. I worry that many of our users won't want to build

[jira] [Commented] (TIKA-2403) Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue

2017-07-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074584#comment-16074584 ] Tim Allison commented on TIKA-2403: --- Some users want everything. Some want only the visible parts. Let

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-07-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074582#comment-16074582 ] ASF GitHub Bot commented on TIKA-2262: -- tballison commented on issue #189: Fix for TIKA-2262:

[jira] [Created] (TIKA-2418) English ASCII text classified as video/quicktime

2017-07-05 Thread Christopher Creutzig (JIRA)
Christopher Creutzig created TIKA-2418: -- Summary: English ASCII text classified as video/quicktime Key: TIKA-2418 URL: https://issues.apache.org/jira/browse/TIKA-2418 Project: Tika

[jira] [Commented] (TIKA-2403) Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue

2017-07-05 Thread Boopathi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074303#comment-16074303 ] Boopathi commented on TIKA-2403: Thanks you so much for the help. Just curious to know why it has been