George,
You should be able to switch to xpdf2text with little effort. It saved us
loads of time trying to figure out why filter-media was taking so long to run
(ours sometimes ran for days at a time) and it now successfully filters 100% of
our pdf documents except for those that are truly corrupt (so it also helps us
identify corrupt docs in our repository). You can find installation
documentation at
https://jira.duraspace.org/secure/attachment/10527/xpdf-filters.html
Good luck,
Sue
Sue Walker-Thornton
Software Developer/Database Administrator
NASA Langley Research Center|LITES Contract
(757) 224-4074
-----Original Message-----
From: George Stanley Kozak [mailto:[email protected]]
Sent: Monday, December 13, 2010 9:52 AM
To: Thornton, Susan M. (LARC-B702)[LITES]; Sean Carte
Cc: [email protected]
Subject: RE: [Dspace-tech] Question about filter-media hanging
Sue and Sean:
Thanks very much. I will look into xpdf2text.
George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
501 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924
-----Original Message-----
From: Thornton, Susan M. (LARC-B702)[LITES] [mailto:[email protected]]
Sent: Monday, December 13, 2010 8:54 AM
To: Sean Carte; George Stanley Kozak
Cc: [email protected]
Subject: RE: [Dspace-tech] Question about filter-media hanging
We had lots of problems with filter-media until we changed pdfbox to xpdf2text.
Sue Walker-Thornton
Software Developer/Database Administrator
NASA Langley Research Center|LITES Contract
(757) 224-4074
-----Original Message-----
From: Sean Carte [mailto:[email protected]]
Sent: Monday, December 13, 2010 4:27 AM
To: George Stanley Kozak
Cc: [email protected]
Subject: Re: [Dspace-tech] Question about filter-media hanging
On 10 December 2010 21:22, George Stanley Kozak <[email protected]> wrote:
> Hi.
>
>
>
> I am running DSpace 1.6.2. Last week we batch loaded about 1500 PDF files
> and this week I loaded about 2300 images (mostly Jpegs). I noticed today
> that the thumbnails hadn't been generated by the filter-media program (which
> runs nightly). When I went to look, I discovered several filter-media
> programs running. It looks like the jobs were hanging up and then the next
> night, a new one started up and that one got hung up, etc.
>
>
>
> I have tried running filter-media in verbose mode, but I am not seeing
> anything in particular that is causing the hang up. No java errors.it just
> seems to hang.
>
>
>
> Does anyone have any suggestions as to what I should next?
>
>
>
> George Kozak
Have a look at the suggestions in this thread:
http://old.nabble.com/-Dspace-tech--filter-media-hanging-td29158622.html#a29158622
Updating pdfbox worked for me.
Sean
--
Sean Carte
esAL Library Systems Manager
+27 72 898 8775
+27 31 373 2490
fax: 0866741254
http://esal.dut.ac.za/
------------------------------------------------------------------------------
Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL,
new data types, scalar functions, improved concurrency, built-in packages,
OCI, SQL*Plus, data movement tools, best practices and more.
http://p.sf.net/sfu/oracle-sfdev2dev
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech