Hello Jose,

you got several types of filter media errors, for the jpeg one best 
check the bitstream to be filtered, might be corrupt.
The "org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194) 
at..." one, update to pdfbox 1.1.0 (including jempbox and fontbox).

Hope that helps

Claudia


Am 13.07.2010 20:00, schrieb Blanco, Jose:
> Clauida,
>
> I noticed the  the setting were like this:
>
> thumbnail.maxwidth  80
> thumbnail.maxheight 80
>
> And I changed it to this:
>
> thumbnail.maxwidth  = 80
> thumbnail.maxheight = 80
>
> I still see the same sorts of errors, is there a suggestion for how to set 
> these?
>
> -----Original Message-----
> From: Claudia Jürgen [mailto:[email protected]]
> Sent: Tuesday, July 13, 2010 11:46 AM
> To: [email protected]
> Subject: Re: [Dspace-tech] filter-media errors.
>
> Hi Jose,
>
> check the configuration setting in dspace.cfg:
> thumbnail.maxheight
>
> Furthermore you can run filter media with the -v (verbose) option to get
> more information.
>
> Hope that helps
>
> Claudia Jürgen
>
>
> Am 13.07.2010 17:29, schrieb Blanco, Jose:
>> As suggested by dspace, filter-media is set to run nightly.  As our
>> instance has grown, so have the number of files that filter media  is
>> not able to index.  We have 60,738 items in our repository, and as of
>> today, filter-media is not able to index 892.  I'm trying to
>> determine if there is anything that can be done so that as many of
>> these 892 items are able to be indexed.  I have copied portion of the
>> output of filter-media below.  Could someone that better understands
>> filter-media let me know if there is something that can be done.
>> Many thanks!  Jose
>>
>>
>>
>> Applying Media Filters ERROR filtering, skipping bitstream:
>>
>> Item Handle: 2027.42/62012 Bundle Name: ORIGINAL File Size: 58
>> Checksum: a500810f390e82e2aead21d5220e7325 (MD5) Asset Store: 1
>> java.lang.IllegalArgumentException: Width (80) and height (0) cannot
>> be<= 0 java.lang.IllegalArgumentException: Width (80) and height (0)
>> cannot be<= 0 at
>> java.awt.image.DirectColorModel.createCompatibleWritableRaster(DirectColorModel.java:999)
>>
>>
> at java.awt.image.BufferedImage.<init>(BufferedImage.java:312)
>> at
>> org.dspace.app.mediafilter.JPEGFilter.getDestinationStream(JPEGFilter.java:161)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>>
>>
> ERROR filtering, skipping bitstream:
>>
>> Item Handle: 2027.42/69214 Bundle Name: ORIGINAL File Size: 268039
>> Checksum: 4e64d97f5a151819da52b095b1fef5d3 (MD5) Asset Store: 1
>> java.lang.NullPointerException java.lang.NullPointerException at
>> org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194) at
>> org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:182) at
>> org.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.java:226)
>>
>>
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
>> at
>> org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>>
>>
> ERROR filtering, skipping bitstream:
>>
>> Item Handle: 2027.42/55391 Bundle Name: ORIGINAL File Size: 473660
>> Checksum: 3686c4d66884a89d81ddfe420a1b661b (MD5) Asset Store: 1
>> java.io.IOException: Unknown encoding for 'Identity-V'
>> java.io.IOException: Unknown encoding for 'Identity-V' at
>> org.pdfbox.encoding.EncodingManager.getEncoding(EncodingManager.java:82)
>>
>>
> at org.pdfbox.pdmodel.font.PDFont.getEncoding(PDFont.java:612)
>> at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:466) at
>> org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:325)
>> at org.pdfbox.util.operator.ShowText.process(ShowText.java:64) at
>> org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
>>
>>
> at
> org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
>> at
>> org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
>>
>>
> at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
>> at
>> org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
>>
>>
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
>> at
>> org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:139)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>>
>>
> ERROR filtering, skipping bitstream:
>>
>> Item Handle: 2027.42/61991 Bundle Name: ORIGINAL File Size: 58
>> Checksum: a500810f390e82e2aead21d5220e7325 (MD5) Asset Store: 1
>> java.lang.IllegalArgumentException: Width (80) and height (0) cannot
>> be<= 0 java.lang.IllegalArgumentException: Width (80) and height (0)
>> cannot be<= 0 at
>> java.awt.image.DirectColorModel.createCompatibleWritableRaster(DirectColorModel.java:999)
>>
>>
> at java.awt.image.BufferedImage.<init>(BufferedImage.java:312)
>> at
>> org.dspace.app.mediafilter.JPEGFilter.getDestinationStream(JPEGFilter.java:161)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>>
>>
> ERROR filtering, skipping bitstream:
>>
>>
>> Item Handle: 2027.42/50480 Bundle Name: ORIGINAL File Size: 177152
>> Checksum: af66e3bb52ebe7f1b4c9cc06fa9a6257 (MD5) Asset Store: 1
>> java.util.NoSuchElementException java.util.NoSuchElementException at
>> java.util.AbstractList$Itr.next(AbstractList.java:350) at
>> org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:150)
>>
>>
> at
> org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.java:95)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>> ERROR filtering, skipping bitstream:
>>
>> Item Handle: 2027.42/40280 Bundle Name: ORIGINAL File Size: 90205
>> Checksum: 2437377db51a8e3b9347c784b61906f9 (MD5) Asset Store: 1
>> java.io.IOException: expected='endobj' firstReadAttempt='endobj154'
>> secondReadAttempt='0' org.pdfbox.io.pushbackinputstr...@122c9df
>> java.io.IOException: expected='endobj' firstReadAttempt='endobj154'
>> secondReadAttempt='0' org.pdfbox.io.pushbackinputstr...@122c9df at
>> org.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:502) at
>> org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:176) at
>> org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:707) at
>> org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:691) at
>> org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java:138)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:674)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:575)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:525)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:493)
>>
>>
> at
> org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:432)
>> at
>> org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:363)
>>
>>
> ERROR filtering, skipping bitstream:
>>
>>
>>
>>
>> ------------------------------------------------------------------------------
>>
>>
> This SF.net email is sponsored by Sprint
>> What will you do first with EVO, the first 4G phone? Visit
>> sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
>> _______________________________________________ DSpace-tech mailing
>> list [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>

-- 
Mit freundlichen Gruessen

Claudia Juergen
Universitaetsbibliothek Dortmund
Eldorado
0231/755-4043
https://eldorado.tu-dortmund.de/

------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to