Thanks on the info Stuart,
It's version 1.5.1 in the moment. I made the changes you mention and it
processed some more. I still have a memory problem do
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:418)
at java.lang.ref.Finalizer.invokeFinalizeMethod(Native Method)
at java.lang.ref.Finalizer.runFinalizer(Finalizer.java:83)
at java.lang.ref.Finalizer.access$100(Finalizer.java:14)
at java.lang.ref.Finalizer
$FinalizerThread.run(Finalizer.java:160)
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
Don't know if it's a bad pdf that's causing this. Anyway the process
should continue with the rest, right ?
Any thoughts on how to solve this issue ?
Cheers, Rui
On Wed, 2009-06-24 at 09:16 +1200, Stuart Lewis wrote:
> Hi Rui,
>
> Which version of DSpace are you running? If you are running 1.5 or
> later, you can set the following options in dspace.cfg:
>
> # If true, larger PDFs are written to a temp file as they are
> indexed...this
> # is slower, but helps ensure that PDFBox software DSpace uses doesn't
> eat up
> # all your memory
> #pdffilter.largepdfs = true
>
> # If true, PDFs which still result in an Out of Memory error from PDFBox
> # are skipped over...these problematic PDFs will never be indexed until
> # memory usage can be decreased in the PDFBox software
> #pdffilter.skiponmemoryexception = true
>
> Thanks,
>
>
> Stuart Lewis
> Digital Services Programmer
> Te Tumu Herenga The University of Auckland Library
> Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand
> Ph: 64 9 373-7599 x81928
> http://www.library.auckland.ac.nz/
>
>
>
>
> -----Original Message-----
> From: Rui Ramos [mailto:[email protected]]
> Sent: Tuesday, 23 June 2009 11:29 p.m.
> To: Dspace Tech
> Subject: [Dspace-tech] filter-media allways get
> java.lang.OutOfMemoryError:Java heap space
>
> Hi *,
>
> I'm getting this error when running filter-media
>
> ...
> ERROR filtering, skipping bitstream:
>
> Item Handle: 10216/10063
> Bundle Name: ORIGINAL
> File Size: 122589469
> Checksum: 7f9aa1bde30b1c3f17b6e8589bcf36f6 (MD5)
> Asset Store: 0
> org.pdfbox.exceptions.WrappedIOException: Java heap space
> org.pdfbox.exceptions.WrappedIOException: Java heap space
> at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:234)
> at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:707)
> at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:691)
> at
> org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java
> :140)
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilt
> erManager.java:652)
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
> rManager.java:554)
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
> ger.java:504)
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
> erManager.java:472)
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media
> FilterManager.java:425)
> at
> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja
> va:359)
> java.lang.OutOfMemoryError: Java heap space Exception in thread "main"
> java.lang.OutOfMemoryError: Java heap space
>
>
> I don't know if this is because of a big pdf file or another thing
> that's causing this.
>
> Is it possible to prevent the filter command to open pdf files bigger
> then XX mb ?
>
> Or skip some of them ?
>
> Another thoughts on how to solve this would be apreciated.
>
> Best regards, Rui
>
>
>
>
------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech