[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738502#comment-14738502
]
mungeol heo commented on TIKA-1728:
---
Yes, I know. It is the reason why I used "file header" at the first
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738454#comment-14738454
]
Nick Burch commented on TIKA-1728:
--
That's the header of one of the OLE2 streams, not of the overall file
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14736533#comment-14736533
]
Nick Burch commented on TIKA-1728:
--
{quote}And the v5 file stores "HWP Document File" in the first 32
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737843#comment-14737843
]
mungeol heo commented on TIKA-1728:
---
Maybe I misunderstand some contents of
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14734289#comment-14734289
]
mungeol heo commented on TIKA-1728:
---
I have created and tested two parsers for v3 and v5 HWP files
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14734531#comment-14734531
]
Nick Burch commented on TIKA-1728:
--
Detection of the v5 file is handled by the OLE2 container-aware
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14734808#comment-14734808
]
Tim Allison commented on TIKA-1728:
---
Opened separate ticket for potential integration: TIKA-1731.
>
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14735875#comment-14735875
]
mungeol heo commented on TIKA-1728:
---
This is what I know,
The v3 file's first 30 bytes includes "HWP
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14734273#comment-14734273
]
mungeol heo commented on TIKA-1728:
---
I have a question about new mime-type of HWP 5.0 which is addressed
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14734081#comment-14734081
]
mungeol heo commented on TIKA-1728:
---
I have tried r1701201, and it works properly.
As far as I know,
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732013#comment-14732013
]
Tim Allison commented on TIKA-1728:
---
Should we look into seeing if the author of
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730605#comment-14730605
]
Nick Burch commented on TIKA-1728:
--
Whoops, I'd set the wrong parent. Can you try with r1701201 or later?
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730639#comment-14730639
]
Hudson commented on TIKA-1728:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #849 (See
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730134#comment-14730134
]
mungeol heo commented on TIKA-1728:
---
I have tested r1700986.
It is working.
Thank you.
> Detection is
[
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730149#comment-14730149
]
mungeol heo commented on TIKA-1728:
---
For detecting, it is working.
> java -jar tika-app-1.10.jar -d
15 matches
Mail list logo