Tim Allison created PDFBOX-1883:
-----------------------------------
Summary: Avoid StringIndexOutOfBoundsException in DateConverter
Key: PDFBOX-1883
URL: https://issues.apache.org/jira/browse/PDFBOX-1883
Project: PDFBox
Issue Type: Bug
Components: PDModel
Affects Versions: 1.8.4
Reporter: Tim Allison
Priority: Trivial
Fix For: 1.8.5
Passing an empty string to parseDate can result in an
ArrayIndexOutOfBoundsException.
{noformat}
Caused by: java.lang.StringIndexOutOfBoundsException: String index out of
range: 0
at java.lang.String.charAt(Unknown Source)
at
org.apache.pdfbox.util.DateConverter.parseDate(DateConverter.java:680)
at
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:808)
at
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:780)
at
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:754)
at org.apache.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797)
at
org.apache.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:210)
at
org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:170)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:142)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
{noformat}
I can't share the triggering document, but I'll submit patch with test case
shortly.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)