Hi Tim,
Taking a fast look at Nick's fix on TIKA-2419 seems conservative to me,
restricted to corrupted xml, so I think there is no need to rerun the
regression tests.
So +1 from me, ++1 with age detection :)
2017-07-05 22:35 GMT-03:00 Allison, Timothy B. :
> All,
> I'm
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075859#comment-16075859
]
Chris A. Mattmann commented on TIKA-2298:
-
fixed, was a simple typo - you forgot to set the config
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075855#comment-16075855
]
Chris A. Mattmann commented on TIKA-2298:
-
docs added in:
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075849#comment-16075849
]
Chris A. Mattmann commented on TIKA-2298:
-
[~talli...@apache.org] your latest update causes Jenkins
[
https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075788#comment-16075788
]
Hudson commented on TIKA-2420:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075786#comment-16075786
]
Hudson commented on TIKA-2298:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See
[
https://issues.apache.org/jira/browse/TIKA-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075787#comment-16075787
]
Hudson commented on TIKA-2089:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1310 (See
Tim I really think I can get AgeDetection in. Let me try now.
Then I’m +1. I’m +1 either way (
On 7/5/17, 6:35 PM, "Allison, Timothy B." wrote:
All,
I'm waiting to get some resolution on TIKA-2399. The regression tests
came back with nothing surprising. I
All,
I'm waiting to get some resolution on TIKA-2399. The regression tests came
back with nothing surprising. I fixed the npe that they uncovered in the new
ppt macro extraction code.
Will I need to rerun with the updates to mime detection that Nick just made?
Or are we good enough to go
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated TIKA-2298:
Labels: ObjectRecognitionParser gsoc memex (was: ObjectRecognitionParser
memex)
> To
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated TIKA-2298:
Labels: ObjectRecognitionParser memex (was: ObjectRecognitionParser)
> To improve object
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075755#comment-16075755
]
Chris A. Mattmann commented on TIKA-2298:
-
YES sounds perfect thanks [~talli...@apache.org]
> To
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075743#comment-16075743
]
Tim Allison commented on TIKA-2298:
---
I'm having the usual proxy problems in my environment with the
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075740#comment-16075740
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-2298.
-
Resolution: Fixed
Assignee: Chris A. Mattmann
Thanks to [~asmehra95] and
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075732#comment-16075732
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on a change in pull request #182:
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075730#comment-16075730
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann closed pull request #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075729#comment-16075729
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on a change in pull request #182:
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075727#comment-16075727
]
ASF GitHub Bot commented on TIKA-2298:
--
tballison commented on a change in pull request #182: Creation
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075728#comment-16075728
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075725#comment-16075725
]
ASF GitHub Bot commented on TIKA-2298:
--
thammegowda commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075718#comment-16075718
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075710#comment-16075710
]
Hudson commented on TIKA-2089:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1309 (See
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075711#comment-16075711
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075700#comment-16075700
]
Tim Allison edited comment on TIKA-2421 at 7/6/17 12:40 AM:
Y, I was wondering
[
https://issues.apache.org/jira/browse/TIKA-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075700#comment-16075700
]
Tim Allison commented on TIKA-2421:
---
Y, I was wondering about this case.
> HTML Encoding Detector should
[
https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075699#comment-16075699
]
Tim Allison commented on TIKA-2420:
---
Do we know which types of query will throw this? Might this change
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075678#comment-16075678
]
ASF GitHub Bot commented on TIKA-2298:
--
thammegowda commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075677#comment-16075677
]
ASF GitHub Bot commented on TIKA-2298:
--
thammegowda commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075614#comment-16075614
]
Gus Heck commented on TIKA-1367:
Maven Shade, Gradle Shadow and OneJar plugins for both Maven and Gradle
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075605#comment-16075605
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075602#comment-16075602
]
ASF GitHub Bot commented on TIKA-2298:
--
chrismattmann commented on issue #182: Creation of TIKA-2298
Nick C created TIKA-2421:
Summary: HTML Encoding Detector should ignore UTF-16 and UTF-32
Key: TIKA-2421
URL: https://issues.apache.org/jira/browse/TIKA-2421
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075572#comment-16075572
]
Nick C commented on TIKA-2420:
--
I'm currently unable to share the document that causes the issue.
> Jackcess
Nick C created TIKA-2420:
Summary: Jackcess toSQLString throws UnsupportedOperationException
for unknown query type
Key: TIKA-2420
URL: https://issues.apache.org/jira/browse/TIKA-2420
Project: Tika
Anyone else seeing build errors in tika-bundle since tika-serialization was
removed?
I had to implement the following patch to fix it:
LMC-053601:tika-bundle mattmann$ git diff
ab4ea4724e52fb5718a9d8ea86af96425fb87c7b
diff --git a/tika-bundle/pom.xml b/tika-bundle/pom.xml
index
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075221#comment-16075221
]
ASF GitHub Bot commented on TIKA-2298:
--
asmehra95 commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075006#comment-16075006
]
Luis Filipe Nassif edited comment on TIKA-2419 at 7/5/17 4:03 PM:
--
Hi
[
https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075006#comment-16075006
]
Luis Filipe Nassif commented on TIKA-2419:
--
Hi Nick,
The original issue of eml(x) being detected
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074923#comment-16074923
]
Tim Allison commented on TIKA-2399:
---
Great. Thank you!
> Version conflict with non-ASL
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074918#comment-16074918
]
Matthew Caruana Galizia commented on TIKA-2399:
---
I've emailed Unidata to ask about publishing
[
https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074889#comment-16074889
]
ASF GitHub Bot commented on TIKA-2298:
--
thammegowda commented on issue #182: Creation of TIKA-2298
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074881#comment-16074881
]
Matthew Caruana Galizia commented on TIKA-2399:
---
Tim, see
[
https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074872#comment-16074872
]
Hudson commented on TIKA-2419:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1308 (See
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843
]
Tim Allison edited comment on TIKA-2399 at 7/5/17 2:30 PM:
---
[~mcaruanagalizia],
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074851#comment-16074851
]
Tim Allison commented on TIKA-2399:
---
https://github.com/Unidata/jj2000/issues/6
> Version conflict with
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843
]
Tim Allison edited comment on TIKA-2399 at 7/5/17 2:23 PM:
---
[~mcaruanagalizia],
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074843#comment-16074843
]
Tim Allison commented on TIKA-2399:
---
[~mcaruanagalizia], would you be able to ask Unidata if they'd mind
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074825#comment-16074825
]
Nick Burch commented on TIKA-2399:
--
We can properly fix this in 2.x when we sort out how to have multiple
[
https://issues.apache.org/jira/browse/TIKA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074806#comment-16074806
]
Nick Burch commented on TIKA-2419:
--
One fix might be to drop the priority of the XML magic to 40 to match
Nick Burch created TIKA-2419:
Summary: Try HTML mime magic on broken XML files
Key: TIKA-2419
URL: https://issues.apache.org/jira/browse/TIKA-2419
Project: Tika
Issue Type: Bug
> The initial intention is, of course, to help to improve the MIME detection in
> Tika core.
Absolutely agree.
> Yes, you'll get few 10,000 more (MS)Office documents thanks to Tika:
Tika-1.15 HTTP-Content-Type
12520application/x-tika-msoffice
[
https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074703#comment-16074703
]
Hudson commented on TIKA-2418:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1307 (See
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074656#comment-16074656
]
Tim Allison commented on TIKA-2399:
---
[~gagravarr], any opinion on this one for 1.16?
> Version conflict
[
https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074654#comment-16074654
]
Christopher Creutzig commented on TIKA-2418:
That was quick, thanks!
> English ASCII text
[
https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074649#comment-16074649
]
Nick Burch commented on TIKA-2418:
--
Hopefully fixed in 0815b2144cf013e1a0803cee72d8076e8c544716 - I've
[
https://issues.apache.org/jira/browse/TIKA-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch resolved TIKA-2418.
--
Resolution: Fixed
Fix Version/s: 1.16
> English ASCII text classified as video/quicktime
>
[
https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074648#comment-16074648
]
Nick Burch commented on TIKA-1367:
--
[~talli...@mitre.org] I'm not sure there is - we've fixed it in Tika
[
https://issues.apache.org/jira/browse/TIKA-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074615#comment-16074615
]
Tim Allison commented on TIKA-1367:
---
Fellow devs, is this something we need to fix for 1.16?
> Tika
[
https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074601#comment-16074601
]
Tim Allison commented on TIKA-2399:
---
Y, probably. I worry that many of our users won't want to build
[
https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074584#comment-16074584
]
Tim Allison commented on TIKA-2403:
---
Some users want everything. Some want only the visible parts.
Let
[
https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074582#comment-16074582
]
ASF GitHub Bot commented on TIKA-2262:
--
tballison commented on issue #189: Fix for TIKA-2262:
Christopher Creutzig created TIKA-2418:
--
Summary: English ASCII text classified as video/quicktime
Key: TIKA-2418
URL: https://issues.apache.org/jira/browse/TIKA-2418
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16074303#comment-16074303
]
Boopathi commented on TIKA-2403:
Thanks you so much for the help. Just curious to know why it has been
64 matches
Mail list logo