Hi,
     I have successfully implemented xpdf pdftotext (replacing PDFBox) in 
DSpace 1.5.1.  It works GREAT and so far has filtered 100% of our documents, 
even the ones PDFBox found to be "unfilterable".  Now I'm trying to get 
pdftoppm to work and I'm getting this error:

Applying Media Filters
The following MediaFilters are enabled:
Full Filter Name: org.dspace.app.mediafilter.HTMLFilter
org.dspace.app.mediafilter.HTMLFilter
Full Filter Name: org.dspace.app.mediafilter.WordFilter
org.dspace.app.mediafilter.WordFilter
Full Filter Name: org.dspace.app.mediafilter.JPEGFilter
org.dspace.app.mediafilter.JPEGFilter
Full Filter Name: org.dspace.app.mediafilter.XPDF2Text
org.dspace.app.mediafilter.XPDF2Text
Full Filter Name: org.dspace.app.mediafilter.XPDF2Thumbnail
org.dspace.app.mediafilter.XPDF2Thumbnail
FILTERED: bitstream 443 and created 'CA029045.pdf.txt'
ERROR filtering, skipping bitstream:

        Item Handle: 2121/169228
        Bundle Name: ORIGINAL
        File Size: 2064225
        Checksum: 4216969d76a86e6c9c169bbe0a3cff7d (MD5)
        Asset Store: 0
javax.imageio.IIOException: Can't read input file!
javax.imageio.IIOException: Can't read input file!
        at javax.imageio.ImageIO.read(ImageIO.java:1275)
        at 
org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream(XPDF2Thumbnail.java:229)
        at 
org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:668)
        at 
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:570)
        at 
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:520)
        at 
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:488)
        at 
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:379)
 Wrote Item: 2121/169228 to Index at Thu Jul 23 12:16:16 EDT 2009

 I think it is complaining because xpdf2Thumbnail needs to know where the input 
file is and where to put the output file(s)....??  Can anyone help with this?

Thanks,
Sue

Sue Walker-Thornton
ConITS Contract
NASA Langley Research Center
Integrated Library Systems Application & Database Administrator
130 Research Drive
Hampton, VA  23666
Office: (757) 224-4074
Fax:    (757) 224-4001
Pager: (757) 988-2547
Email:  [email protected]<mailto:[email protected]>

------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
  • [Dspac... Thornton, Susan M. (LARC-B702)[RAYTHEON TECHNICAL SERVICES COMPANY]
    • [... Fabio N. Kepler
      • ... Thornton, Susan M. (LARC-B702)[RAYTHEON TECHNICAL SERVICES COMPANY]
        • ... Fabio N. Kepler
          • ... Thornton, Susan M. (LARC-B702)[RAYTHEON TECHNICAL SERVICES COMPANY]
            • ... Fabio N. Kepler
              • ... Thornton, Susan M. (LARC-B702)[RAYTHEON TECHNICAL SERVICES COMPANY]
                • ... Fabio N. Kepler

Reply via email to