Scott (and other Developers), I'd actually recommend that we should *backport* a fix for this issue to 1.4.2. I know these new options are available in 1.5. But, from what I've heard (both on & off list), this OutOfMemory problem with the PDF filter is becoming more and more prevalent in 1.4.2 installations of DSpace.
It seems the "PDFBox" software (which we use w/the PDF filter) still hasn't fixed this issue themselves, and that's the software that is eating up all the server memory. I logged the below bug with PDFBox back in October, but haven't received any fix/resolution from their end: http://sourceforge.net/tracker/index.php?func=detail&aid=1805929&group_id=78314&atid=552832 I created more of a "hack" way to get around this issue in our 1.4.2 instance, but there's definitely a much more elegant way of doing so in the upcoming 1.5 release (with the 'pdffilter.skiponmemoryexception' option) So, we might want to considering helping folks out and backporting a patch for DSpace 1.4.2 which can be installed to enable those options in the 1.4.2 dspace.cfg. I believe Graham was the one to add in those new options (correct me if I'm wrong, Graham)?? If he doesn't already have a 1.4.2 patch and/or cannot find time to create one, I should be able to create one later this week (i.e. hopefully by Fri) for all those struggling with this problem. - Tim Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS] wrote: > Oh well...we are running 1.4.2. :-( Any other ideas? > Thanks, > Sue > > -----Original Message----- > From: Scott Phillips [mailto:[EMAIL PROTECTED] > Sent: Wednesday, March 05, 2008 12:50 AM > To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS] > Cc: [email protected] > Subject: Re: [Dspace-tech] Error in filter-media > > > Ah, > > Yes, these parameters only apply to DSpace 1.5 and are not available > in previous versions. > > Scott-- > > > On Mar 4, 2008, at 8:51 PM, Thornton, Susan M. (LARC-B702)[NCI > INFORMATION SYSTEMS] wrote: > >> Hi again Scott, >> I added both of those to dspace.cfg and it didn't seem to have any >> effect - I got the same error again: >> >> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space >> >> Are those parameters valid for DSpace 1.4.2? >> >> Thanks again, >> Sue >> >> >> >> -----Original Message----- >> From: Scott Phillips [mailto:[EMAIL PROTECTED] >> Sent: Tuesday, March 04, 2008 7:28 PM >> To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS] >> Cc: [email protected] >> Subject: Re: [Dspace-tech] Error in filter-media >> >> >> Oh, Sorry. >> >> The parameters go into your dspace.cfg. They are both boolean >> parameters so just add: >> >> pdffilter.largepdfs = true; >> >> or >> >> pdffilter.skiponmemoryexception = true; >> >> Then re-run your dsrun command and they will take effect. >> >> Scott-- >> >> On Mar 4, 2008, at 6:13 PM, Thornton, Susan M. (LARC-B702)[NCI >> INFORMATION SYSTEMS] wrote: >> >>> Hi Scott, >>> Thanks for the info, but can you tell me exactly where to put/ >>> how to format the parameters? Do they go on the dsrun command? >>> Thanks again, >>> Sue >>> >>> From: Scott Phillips [mailto:[EMAIL PROTECTED] >>> Sent: Tuesday, March 04, 2008 6:51 PM >>> To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS] >>> Cc: [email protected] >>> Subject: Re: [Dspace-tech] Error in filter-media >>> >>> >>> Have you tried using the "pdffilter.largepdfs" and >>> "pdffilter.skiponmemoryexception" parameters? >>> >>> Turning on largepdfs will tell pdfbox to use a temporary file while >>> processing the pdf instead of keeping it all in memory. It will use >>> the standard java.io.tmpdir directory to store them in. In addition >>> using skip memory exception will allow the process to just continue >>> instead of blocking everything else. >>> >>> Scott-- >>> >>> >>> On Mar 4, 2008, at 3:12 PM, Thornton, Susan M. (LARC-B702)[NCI >>> INFORMATION SYSTEMS] wrote: >>> >>> >>> We have recently been getting the following error in the filter- >>> media cron: >>> >>> Exception in thread "main" java.lang.OutOfMemoryError: Java heap >>> space >>> >>> >>> I changed -Xmx256m in the java command in dsrun to -Xmx512m >>> (documentation says that's the max you can set it to) and we're >>> still getting the error. How can we get rid of this error? >>> >>> Here is the full command in our dsrun script: >>> >>> java -Xmx512m -classpath $FULLPATH "$@" >>> >>> We are running a production instance of DSpace 1.4.2 with postgreSQL >>> 7.3.4 on a Solaris 10 platform. >>> >>> Thanks in advance. >>> Sue >>> >>> >>> Sue Walker-Thornton >>> ConITS Contract >>> NASA Langley Research Center >>> Integrated Library Systems Application Administrator/DBA >>> 130 Research Drive >>> Hampton, VA 23666 >>> Office: (757) 224-4074 >>> Fax: (757) 224-4001 >>> Pager: (757) 988-2547 >>> Email: [EMAIL PROTECTED] >>> >>> > ------------------------------------------------------------------------ >> - >>> This SF.net email is sponsored by: Microsoft >>> Defy all challenges. Microsoft(R) Visual Studio 2008. >>> > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/_________________ >> ______________________________ >>> DSpace-tech mailing list >>> [email protected] >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech >>> > > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > DSpace-tech mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dspace-tech > -- ======================================== Tim Donohue Research Programmer, Illinois Digital Environment for Access to Learning and Scholarship (IDEALS) 135 Grainger Engineering Library University of Illinois at Urbana-Champaign email: [EMAIL PROTECTED] web: http://www.ideals.uiuc.edu phone: (217) 333-4648 fax: (217) 244-7764 ======================================== ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

