[ 
https://issues.apache.org/jira/browse/PDFBOX-1907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14179772#comment-14179772
 ] 

Maruan Sahyoun commented on PDFBOX-1907:
----------------------------------------

[~mgoddard] I’ve created a separate issue for that as - although the end result 
might be the same - the reason is probably not. I’ve also tested with the 
latests snapshots and confirm that this is still an issue.

Could you take the relevant information and put it into PDFBOX-2445?

> Out of memory - COSDocument (RandomAccessBuffer)
> ------------------------------------------------
>
>                 Key: PDFBOX-1907
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1907
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.4
>         Environment: windows xp 64
> jdk 8 32 bit
>            Reporter: Jim Kay
>            Assignee: Andreas Lehmkühler
>              Labels: regression
>         Attachments: 8283.zip.001, 8283.zip.002, 8283.zip.003
>
>
> Possibly related to PDFBOX-1777.
> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
>       at java.util.AbstractCollection.toArray(AbstractCollection.java:136)
>       at java.util.ArrayList.<init>(ArrayList.java:168)
>       at org.apache.pdfbox.cos.COSDocument.getObjects(COSDocument.java:518)
>       at org.apache.pdfbox.cos.COSDocument.getObjects(COSDocument.java:518)
>       at org.apache.pdfbox.cos.COSDocument.close(COSDocument.java:575)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:254)
>       at techref.Testpdfbox.main(Testpdfbox.java:36)
> The heap space is set to -Xmx1640m
> The pdf docoument is parsed OK with version 1.8.3 but fails with 1.8.4
> The large pdf document has the following attributes.
> pdDoc.getCurrentAccessPermission.canExtractContent = true
> pdDoc.getCurrentAccessPermission.canExtractForAccessibility = true
> pdDoc.getNumberOfPages = 228
> pdDoc.getDocumentCatalog.getLanguage = null
> pdDoc.getDocumentCatalog.getPageLayout = SinglePage
> pdDoc.getDocumentCatalog.getPageMode = UseNone
> pdDoc.getDocumentCatalog.getVersion = null
> Page Count=228
> Title=Microsoft Word - FEA.doc
> Author=null
> Subject=null
> Keywords=null
> Creator=Windows NT 4.0
> Producer=Acrobat Distiller 4.05 for Windows
> Creation Date=Fri Jun 29 15:29:59 BST 2001
> Modification Date=Mon Jul 02 15:41:18 BST 2001
> Trapped=null
> Dictionary=COSDictionary{(COSName{CreationDate}:COSString{D:20010629142959}) 
> (COSName{Producer}:COSString{Acrobat Distiller 4.05 for Windows}) 
> (COSName{Creator}:COSString{Windows NT 4.0}) 
> (COSName{Title}:COSString{Microsoft Word - FEA.doc}) 
> (COSName{ModDate}:COSString{D:20010702164118+02'00'}) }



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to