Tim Allison created PDFBOX-1883:
-----------------------------------

             Summary: Avoid StringIndexOutOfBoundsException in DateConverter
                 Key: PDFBOX-1883
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1883
             Project: PDFBox
          Issue Type: Bug
          Components: PDModel
    Affects Versions: 1.8.4
            Reporter: Tim Allison
            Priority: Trivial
             Fix For: 1.8.5


Passing an empty string to parseDate can result in an 
ArrayIndexOutOfBoundsException.

{noformat}
Caused by: java.lang.StringIndexOutOfBoundsException: String index out of 
range: 0
        at java.lang.String.charAt(Unknown Source)
        at 
org.apache.pdfbox.util.DateConverter.parseDate(DateConverter.java:680)
        at 
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:808)
        at 
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:780)
        at 
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:754)
        at org.apache.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797)
        at 
org.apache.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:210)
        at 
org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:170)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:142)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
{noformat}

I can't share the triggering document, but I'll submit patch with test case 
shortly.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to