Hello,

I filed a Jira issue for DSpace 5.x here:
https://jira.duraspace.org/browse/DS-3516

Would be nice if someone could test this on DSpace 6.x and see if it is
still an issue.

Cheers,

On Thu, Mar 2, 2017 at 6:10 PM Alan Orth <[email protected]> wrote:

> Hello,
>
> I did a bit of testing and it seems that this problem with arises when an
> item has a JPG in the ORIGINAL bundle. The "ImageMagick PDF Thumbnail"
> plugin will process both PDFs and JPGs if they are present in the ORIGINAL
> bundle, despite being configured to only process "Adobe PDF" input formats.
> In our case we have some JPGs in the ORIGINAL bundle because editors had
> manually created thumbnails and uploaded them during item submission, but
> this is beside the issue.
>
> - Item with a PDF in the ORIGINAL bundle:
>
> $ [dspace]/bin/dspace filter-media -f -i 10568/16881 -p "ImageMagick PDF
> Thumbnail" -v
> The following MediaFilters are enabled:
> Full Filter Name: org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
> org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
> IM Thumbnail earlywinproposal_esa_postharvest.pdf is replacable.
> File: earlywinproposal_esa_postharvest.pdf.jpg
> IM Image Param: /tmp/impdfthumb6654586450122351471.pdf[0] -flatten
> /tmp/impdfthumb6654586450122351471.pdf.jpg
> IM Thumbnail Param: /tmp/impdfthumb6654586450122351471.pdf.jpg -thumbnail
> 300x300 /tmp/impdfthumb6654586450122351471.pdf.jpg.jpg
> FILTERED: bitstream 13787 (item: 10568/16881) and created
> 'earlywinproposal_esa_postharvest.pdf.jpg'
>
> - Item with a JPG in the ORIGINAL bundle:
>
> $ [dspace]/bin/dspace filter-media -f -i 10568/33941  -p "ImageMagick PDF
> Thumbnail" -v
> The following MediaFilters are enabled:
> Full Filter Name: org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
> org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
> Generated Thumbnail strengtheningPig.jpg matches pattern and is replacable.
> File: strengtheningPig.jpg.jpg
> IM Image Param: /tmp/impdfthumb5315798183586620841.pdf[0] -flatten
> /tmp/impdfthumb5315798183586620841.pdf.jpg
> IM Thumbnail Param: /tmp/impdfthumb5315798183586620841.pdf.jpg -thumbnail
> 300x300 /tmp/impdfthumb5315798183586620841.pdf.jpg.jpg
> FILTERED: bitstream 23121 (item: 10568/33941) and created
> 'strengtheningPig.jpg.jpg'
>
> - Item with a JPG in the THUMBNAIL bundle (manually uploaded after item
> submission):
> $ [dspace]/bin/dspace filter-media -f -i 10568/24655 -p "ImageMagick PDF
> Thumbnail" -v
> The following MediaFilters are enabled:
> Full Filter Name: org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
> org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter
>
> The expected behavior is that the "ImageMagick PDF Thumbnail" plugin
> should not process JPGs, but it does process them when they are in the
> ORIGINAL bundle, despite its configuration in dspace.cfg. We are using
> DSpace 5.5. I will file an issue on Jira.
>
> Regards,
>
> On Tue, Feb 21, 2017 at 11:08 AM Alan Orth <[email protected]> wrote:
>
> I think I found a bug in filter-media. I'm trying to force the
> re-generation of all PDF thumbnails in a collection by limiting the
> filter-media command to the ImageMagick PDF Thumbnail plugin, but I see it
> still processing JPGs:
>
> ---
> $ [dspace]/bin/dspace filter-media -f -i 10568/16856 -p "ImageMagick PDF
> Thumbnail"
> ...
> File: EnvtNaturalRes.jpg.jpg
> FILTERED: bitstream 80165 (item: 10568/76133) and created
> 'EnvtNaturalRes.jpg.jpg'
> File: zemadim_2016.pdf.jpg
> FILTERED: bitstream 85076 (item: 10568/77324) and created
> 'zemadim_2016.pdf.jpg'
> ---
>
> The configuration for filter-media's ImageMagick plugins is:
>
> ---
> filter.org.dspace.app.mediafilter.ImageMagickImageThumbnailFilter.inputFormats
> = BMP, GIF, image/png, JPG, TIFF, JPEG, JPEG 2000
> filter.org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter.inputFormats
> = Adobe PDF
> ---
>
> The expected behavior is that filter-media only processes bitstreams
> matching the input formats listed in the plugin's configuration. In my case
> I would be generating thumbnails for thousands of items and so this creates
> lots of extra I/O and wastes CPU cycles.
>
> We are running DSpace version 5.5.
>
> Thank you,
> --
>
> Alan Orth
> [email protected]
> https://englishbulgaria.net
> https://alaninkenya.org
> https://mjanja.ch
>
> --
>
> Alan Orth
> [email protected]
> https://englishbulgaria.net
> https://alaninkenya.org
> https://mjanja.ch
>
-- 

Alan Orth
[email protected]
https://englishbulgaria.net
https://alaninkenya.org
https://mjanja.ch

-- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.

Reply via email to