[ 
https://issues.apache.org/jira/browse/TIKA-418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12883511#action_12883511
 ] 

Nick Burch commented on TIKA-418:
---------------------------------

ppsx, ppsm, pptm and pptx are all supported, and I've added a unit test that 
confirms this

.thmx isn't currently supported, but will be after the next POI upgrade. 
However, the .thmx file format doesn't seem to have any textual content in it! 
Certainly none of your sample words show up in the example file

.xps isn't currently supported by POI. Quite a lot of work will be needed for 
it, as it's not really like any of the other currently supported ooxml file 
formats. Please join the POI dev list and start sending patches if you're 
interested in adding support!

> RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps 
> file types
> ------------------------------------------------------------------------------------
>
>                 Key: TIKA-418
>                 URL: https://issues.apache.org/jira/browse/TIKA-418
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.7
>         Environment: Windows
>            Reporter: Rajiv Kumar
>         Attachments: Guitarras de luna - tierra mestiza.mp3, MSPPT2007.ppsm, 
> MSPPT2007.ppsx, MSPPT2007.pptm, MSPPT2007.pptx, MSPPT2007.thmx, MSPPT2007.xps
>
>
> I am getting the following error
> Unexpected RuntimeException from 
> org.apache.tika.parser.microsoft.ooxml.ooxmlpar...@269b15
> for the following file types
> .PPSM
> .PPSX
> .PPTM
> .THMX
> .XPS
> .XLSB

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to