[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378151#comment-17378151
]
Packiaraj Sakkanan commented on TIKA-3466:
--
[~nick],
The mentioned scenrio is pretty much
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377953#comment-17377953
]
Nick Burch commented on TIKA-3466:
--
[~psakkanan] You really need to be doing some xml parsing /
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377563#comment-17377563
]
Packiaraj Sakkanan commented on TIKA-3466:
--
Hi [~tallison]
Here is the stripped-down version of
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377548#comment-17377548
]
Tim Allison commented on TIKA-3466:
---
And, for the record, the {{file}} command (file-5.37) identifies
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377542#comment-17377542
]
Tim Allison commented on TIKA-3466:
---
We need to do as much as we can on Tika to get file detection
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377508#comment-17377508
]
Packiaraj Sakkanan commented on TIKA-3466:
--
HiĀ [~nick],
We are having problem with allow-list. We
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377317#comment-17377317
]
Nick Burch commented on TIKA-3466:
--
I'm happy to add the xmlns version as a match, that seems pretty
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376851#comment-17376851
]
Kenneth William Krugler commented on TIKA-3466:
---
Hi [~psakkanan] - that namespace is inside
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376828#comment-17376828
]
Tim Allison commented on TIKA-3466:
---
At a high level, Tika does a pretty good job on files in the wild,
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376798#comment-17376798
]
Packiaraj Sakkanan commented on TIKA-3466:
--
Wouldn't be sufficent to check namepsace alone
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376771#comment-17376771
]
Kenneth William Krugler commented on TIKA-3466:
---
Browsers do all kinds of helicopter stunts
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376760#comment-17376760
]
Packiaraj Sakkanan commented on TIKA-3466:
--
The problem here is that this sample file is rendered
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376712#comment-17376712
]
Kenneth William Krugler commented on TIKA-3466:
---
This looks like broken HTML. Which we would
[
https://issues.apache.org/jira/browse/TIKA-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17376689#comment-17376689
]
Nick Burch commented on TIKA-3466:
--
I've never seen a file that like before, but I'm sure Tim will pop
14 matches
Mail list logo