Scott (and other Developers),

I'd actually recommend that we should *backport* a fix for this issue to 
1.4.2.   I know these new options are available in 1.5.  But, from what 
I've heard (both on & off list), this OutOfMemory problem with the PDF 
filter is becoming more and more prevalent in 1.4.2 installations of DSpace.

It seems the "PDFBox" software (which we use w/the PDF filter) still 
hasn't fixed this issue themselves, and that's the software that is 
eating up all the server memory.  I logged the below bug with PDFBox 
back in October, but haven't received any fix/resolution from their end:

http://sourceforge.net/tracker/index.php?func=detail&aid=1805929&group_id=78314&atid=552832

I created more of a "hack" way to get around this issue in our 1.4.2 
instance, but there's definitely a much more elegant way of doing so in 
the upcoming 1.5 release (with the 'pdffilter.skiponmemoryexception' option)

So, we might want to considering helping folks out and backporting a 
patch for DSpace 1.4.2 which can be installed to enable those options in 
the 1.4.2 dspace.cfg.

I believe Graham was the one to add in those new options (correct me if 
I'm wrong, Graham)??  If he doesn't already have a 1.4.2 patch and/or 
cannot find time to create one, I should be able to create one later 
this week (i.e. hopefully by Fri) for all those struggling with this 
problem.

- Tim


Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS] wrote:
> Oh well...we are running 1.4.2.  :-(   Any other ideas?
> Thanks,
> Sue
> 
> -----Original Message-----
> From: Scott Phillips [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, March 05, 2008 12:50 AM
> To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
> Cc: [email protected]
> Subject: Re: [Dspace-tech] Error in filter-media
> 
> 
> Ah,
> 
> Yes, these parameters only apply to DSpace 1.5 and are not available  
> in previous versions.
> 
> Scott--
> 
> 
> On Mar 4, 2008, at 8:51 PM, Thornton, Susan M. (LARC-B702)[NCI  
> INFORMATION SYSTEMS] wrote:
> 
>> Hi again Scott,
>>     I added both of those to dspace.cfg and it didn't seem to have any
>> effect - I got the same error again:
>>
>> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
>>
>> Are those parameters valid for DSpace 1.4.2?
>>
>> Thanks again,
>> Sue
>>
>>
>>
>> -----Original Message-----
>> From: Scott Phillips [mailto:[EMAIL PROTECTED]
>> Sent: Tuesday, March 04, 2008 7:28 PM
>> To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
>> Cc: [email protected]
>> Subject: Re: [Dspace-tech] Error in filter-media
>>
>>
>> Oh, Sorry.
>>
>> The parameters go into your dspace.cfg. They are both boolean
>> parameters so just add:
>>
>> pdffilter.largepdfs = true;
>>
>> or
>>
>> pdffilter.skiponmemoryexception = true;
>>
>> Then re-run your dsrun command and they will take effect.
>>
>> Scott--
>>
>> On Mar 4, 2008, at 6:13 PM, Thornton, Susan M. (LARC-B702)[NCI
>> INFORMATION SYSTEMS] wrote:
>>
>>> Hi Scott,
>>>     Thanks for the info, but can you tell me exactly where to put/
>>> how to format the parameters?  Do they go on the dsrun command?
>>> Thanks again,
>>> Sue
>>>
>>> From: Scott Phillips [mailto:[EMAIL PROTECTED]
>>> Sent: Tuesday, March 04, 2008 6:51 PM
>>> To: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
>>> Cc: [email protected]
>>> Subject: Re: [Dspace-tech] Error in filter-media
>>>
>>>
>>> Have you tried using the "pdffilter.largepdfs" and
>>> "pdffilter.skiponmemoryexception" parameters?
>>>
>>> Turning on largepdfs will tell pdfbox to use a temporary file while
>>> processing the pdf instead of keeping it all in memory. It will use
>>> the standard java.io.tmpdir directory to store them in. In addition
>>> using skip memory exception will allow the process to just continue
>>> instead of blocking everything else.
>>>
>>> Scott--
>>>
>>>
>>> On Mar 4, 2008, at 3:12 PM, Thornton, Susan M. (LARC-B702)[NCI
>>> INFORMATION SYSTEMS] wrote:
>>>
>>>
>>> We have recently been getting the following error in the filter-
>>> media cron:
>>>
>>> Exception in thread "main" java.lang.OutOfMemoryError: Java heap  
>>> space
>>>
>>>
>>> I changed -Xmx256m in the java command in dsrun to -Xmx512m
>>> (documentation says that's the max you can set it to) and we're
>>> still getting the error.  How can we get rid of this error?
>>>
>>> Here is the full command in our dsrun script:
>>>
>>> java -Xmx512m -classpath $FULLPATH "$@"
>>>
>>> We are running a production instance of DSpace 1.4.2 with postgreSQL
>>> 7.3.4 on a Solaris 10 platform.
>>>
>>> Thanks in advance.
>>> Sue
>>>
>>>
>>> Sue Walker-Thornton
>>> ConITS Contract
>>> NASA Langley Research Center
>>> Integrated Library Systems Application Administrator/DBA
>>> 130 Research Drive
>>> Hampton, VA  23666
>>> Office: (757) 224-4074
>>> Fax:    (757) 224-4001
>>> Pager: (757) 988-2547
>>> Email:  [EMAIL PROTECTED]
>>>
>>>
> ------------------------------------------------------------------------
>> -
>>> This SF.net email is sponsored by: Microsoft
>>> Defy all challenges. Microsoft(R) Visual Studio 2008.
>>>
> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/_________________
>> ______________________________
>>> DSpace-tech mailing list
>>> [email protected]
>>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>>
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> 

-- 

========================================
Tim Donohue
Research Programmer, Illinois Digital Environment for
Access to Learning and Scholarship (IDEALS)
135 Grainger Engineering Library
University of Illinois at Urbana-Champaign

email: [EMAIL PROTECTED]
web:   http://www.ideals.uiuc.edu
phone: (217) 333-4648
fax:   (217) 244-7764
========================================

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
  • [Dspace-tech] Erro... Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
    • Re: [Dspace-t... Scott Phillips
      • Re: [Dspa... Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
        • Re: [... Scott Phillips
          • R... Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
            • ... Scott Phillips
              • ... Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
                • ... Tim Donohue
                • ... Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
                • ... Rowan Brownlee
                • ... Jayan Chirayath Kurian

Reply via email to