Patches item #2745393, was opened at 2009-04-08 17:09
Message generated for change (Tracker Item Submitted) made by lcs8
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=319984&aid=2745393&group_id=19984

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Larry Stone (lcs8)
Assigned to: Nobody/Anonymous (nobody)
Summary: PDF media filters based on XPDF tools

Initial Comment:
Here are a pair of mediafilters to process PDF files with the
XPDF suite (see http://www.foolabs.com/xpdf/ ) replacing the
one based on PDFBox.  They invoke an external command, which
must be configured.  It has been tested on Unix and the concept
ought to work on Windows (and certainly on MacOS X). 

XPDF2Text is a replacement for the existing PDF media filter, it
creates extracted text using the pdftotext program.  I've observed it
is about 3 times as fast, and much more reliable, than PDFBox.

XPDF2Thumbnail creates a thumbnail image for the first page of
the PDF.  This is especially effective for 3D PDF renderings of
engineering models, but works fine for any document.

See the instructions in xpdf-filters.html to install it.
The thumbnail filter needs an additional image library, but
the text extractor doesn't need anything else.

This code has been tested with DSpace 1.5.1

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=319984&aid=2745393&group_id=19984

------------------------------------------------------------------------------
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to