senthil created TIKA-1658:
-----------------------------
Summary: unable to parse microsoft visio files with tika
Key: TIKA-1658
URL: https://issues.apache.org/jira/browse/TIKA-1658
Project: Tika
Issue Type: Bug
Components: metadata
Affects Versions: 1.8, 1.5, 1.4, 1.3
Environment: ubuntu 14.04 and windows 7
Reporter: senthil
hi
With parsing an microsoft visio it throws an exception.
org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.microsoft.OfficeParser@13d28e3
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
Caused by: java.lang.RuntimeException: TODO
at
org.apache.poi.hdgf.pointers.PointerFactory.createPointer(PointerFactory.java:45)
at org.apache.poi.hdgf.HDGFDiagram.<init>(HDGFDiagram.java:99)
application/vnd.visio
at
org.apache.poi.hdgf.extractor.VisioTextExtractor.<init>(VisioTextExtractor.java:55)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:200)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
... 4 more
Please help with a resolution
regards
sentil
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)