[
https://issues.apache.org/jira/browse/TIKA-1727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15047063#comment-15047063
]
Tim Allison commented on TIKA-1727:
-----------------------------------
I think that it'll be tightly tied to Tika, writing structured info to the
XHTMLContenthandler.
As a first step, I'll borrow {{test_text_extraction.vsdx}} so that we at least
have multiple shapes.
Are there other vsdx features that we should test (this is probably better in
POI)...titles, footers, footnotes, comments, sdts, embedded files, inline
images, recursive shapes (?)...
I'm not familiar enough with Visio to know the odds and ends that we'll want to
test. Thank you, again, for contributing this chunk of code to POI!
> Add Tika wrapper to handle Visio .vsdx files once parser is available in POI
> ----------------------------------------------------------------------------
>
> Key: TIKA-1727
> URL: https://issues.apache.org/jira/browse/TIKA-1727
> Project: Tika
> Issue Type: New Feature
> Reporter: Tim Allison
> Priority: Minor
>
> This [issue|https://bz.apache.org/bugzilla/show_bug.cgi?id=58087] in POI is
> tracking the contribution by [~virtuald] of a parser for Visio .vsdx files.
> Once that is rolled into POI, let's add handling for it in Tika.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)