Hi Rui, Which version of DSpace are you running? If you are running 1.5 or later, you can set the following options in dspace.cfg:
# If true, larger PDFs are written to a temp file as they are indexed...this # is slower, but helps ensure that PDFBox software DSpace uses doesn't eat up # all your memory #pdffilter.largepdfs = true # If true, PDFs which still result in an Out of Memory error from PDFBox # are skipped over...these problematic PDFs will never be indexed until # memory usage can be decreased in the PDFBox software #pdffilter.skiponmemoryexception = true Thanks, Stuart Lewis Digital Services Programmer Te Tumu Herenga The University of Auckland Library Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand Ph: 64 9 373-7599 x81928 http://www.library.auckland.ac.nz/ -----Original Message----- From: Rui Ramos [mailto:[email protected]] Sent: Tuesday, 23 June 2009 11:29 p.m. To: Dspace Tech Subject: [Dspace-tech] filter-media allways get java.lang.OutOfMemoryError:Java heap space Hi *, I'm getting this error when running filter-media ... ERROR filtering, skipping bitstream: Item Handle: 10216/10063 Bundle Name: ORIGINAL File Size: 122589469 Checksum: 7f9aa1bde30b1c3f17b6e8589bcf36f6 (MD5) Asset Store: 0 org.pdfbox.exceptions.WrappedIOException: Java heap space org.pdfbox.exceptions.WrappedIOException: Java heap space at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:234) at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:707) at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:691) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java :140) at org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilt erManager.java:652) at org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte rManager.java:554) at org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana ger.java:504) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt erManager.java:472) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media FilterManager.java:425) at org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja va:359) java.lang.OutOfMemoryError: Java heap space Exception in thread "main" java.lang.OutOfMemoryError: Java heap space I don't know if this is because of a big pdf file or another thing that's causing this. Is it possible to prevent the filter command to open pdf files bigger then XX mb ? Or skip some of them ? Another thoughts on how to solve this would be apreciated. Best regards, Rui ------------------------------------------------------------------------------ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

