Hi Rui,

Which version of DSpace are you running? If you are running 1.5 or
later, you can set the following options in dspace.cfg:

# If true, larger PDFs are written to a temp file as they are
indexed...this 
# is slower, but helps ensure that PDFBox software DSpace uses doesn't
eat up
# all your memory 
#pdffilter.largepdfs = true

# If true, PDFs which still result in an Out of Memory error from PDFBox
# are skipped over...these problematic PDFs will never be indexed until 
# memory usage can be decreased in the PDFBox software
#pdffilter.skiponmemoryexception = true

Thanks,


Stuart Lewis
Digital Services Programmer
Te Tumu Herenga The University of Auckland Library
Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand
Ph: 64 9 373-7599 x81928
http://www.library.auckland.ac.nz/




-----Original Message-----
From: Rui Ramos [mailto:[email protected]] 
Sent: Tuesday, 23 June 2009 11:29 p.m.
To: Dspace Tech
Subject: [Dspace-tech] filter-media allways get
java.lang.OutOfMemoryError:Java heap space

Hi *,

 I'm getting this error when running filter-media

...
ERROR filtering, skipping bitstream:

        Item Handle: 10216/10063
        Bundle Name: ORIGINAL
        File Size: 122589469
        Checksum: 7f9aa1bde30b1c3f17b6e8589bcf36f6 (MD5)
        Asset Store: 0
org.pdfbox.exceptions.WrappedIOException: Java heap space
org.pdfbox.exceptions.WrappedIOException: Java heap space
        at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:234)
        at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:707)
        at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:691)
        at
org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java
:140)
        at
org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilt
erManager.java:652)
        at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
rManager.java:554)
        at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
ger.java:504)
        at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
erManager.java:472)
        at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media
FilterManager.java:425)
        at
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja
va:359)
java.lang.OutOfMemoryError: Java heap space Exception in thread "main"
java.lang.OutOfMemoryError: Java heap space


I don't know if this is because of a big pdf file or another thing
that's causing this.

Is it possible to prevent the filter command to open pdf files bigger
then XX mb ?

Or skip some of them ?

Another thoughts on how to solve this would be apreciated.

 Best regards, Rui



------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to