[ https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tika User updated TIKA-3642: ---------------------------- Description: When parsing large PDF files(1.65 GB) we are getting out of memory error. The version we are using 2.0.25(pdfbox) java.lang.OutOfMemoryError: Java heap space at org.apache.pdfbox.pdfparser.COSParser.isString was: When parsing large PDF files(1.65 GB) we are getting out of memory error. The version we are using 2.2.1 java.lang.OutOfMemoryError: Java heap space at org.apache.pdfbox.pdfparser.COSParser.isString > Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file > ------------------------------------------------------------------------- > > Key: TIKA-3642 > URL: https://issues.apache.org/jira/browse/TIKA-3642 > Project: Tika > Issue Type: Bug > Reporter: Tika User > Priority: Major > > When parsing large PDF files(1.65 GB) we are getting out of memory error. The > version we are using 2.0.25(pdfbox) > java.lang.OutOfMemoryError: Java heap space at > org.apache.pdfbox.pdfparser.COSParser.isString -- This message was sent by Atlassian Jira (v8.20.1#820001)