Tim Allison created TIKA-2523:
---------------------------------
Summary: Regression in ppt parsing
Key: TIKA-2523
URL: https://issues.apache.org/jira/browse/TIKA-2523
Project: Tika
Issue Type: Bug
Reporter: Tim Allison
Attachments: 802350.ppt
We noticed a regression in ppt parsing in POI 3.17 while running the large
scale regression tests in prep for the release of Tika 1.17. There are about
200 new exceptions, but it looks like there is only one cause.
Stacktrace:
{noformat}
org.apache.poi.hslf.exceptions.HSLFException: Couldn't instantiate the class
for type with id 1000 on class class org.apache.poi.hslf.record.Document :
java.lang.reflect.InvocationTargetException
Cause was : org.apache.poi.hslf.exceptions.HSLFException: Couldn't instantiate
the class for type with id 1010 on class class
org.apache.poi.hslf.record.Environment :
java.lang.reflect.InvocationTargetException
Cause was : org.apache.poi.hslf.exceptions.HSLFException: Couldn't instantiate
the class for type with id 2005 on class class
org.apache.poi.hslf.record.FontCollection :
java.lang.reflect.InvocationTargetException
Cause was : java.lang.IllegalArgumentException: typeface can't be null nor empty
at
org.apache.poi.hslf.record.Record.createRecordForType(Record.java:186)
at
org.apache.poi.hslf.record.Record.buildRecordAtOffset(Record.java:104)
at
org.apache.poi.hslf.usermodel.HSLFSlideShowImpl.read(HSLFSlideShowImpl.java:279)
at
org.apache.poi.hslf.usermodel.HSLFSlideShowImpl.buildRecords(HSLFSlideShowImpl.java:260)
at
org.apache.poi.hslf.usermodel.HSLFSlideShowImpl.<init>(HSLFSlideShowImpl.java:166)
at
org.apache.poi.hslf.usermodel.HSLFSlideShow.<init>(HSLFSlideShow.java:181)
at
org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:78)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)