Hi All,

I've found a similar situation with 1.8.2 and the problem still is with the 
pdftoppm tool when extract the PDF's thumbnail, the tool has changed the output 
file naming schema.

In the logs (in debug mode) throws this:

2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ DPI: 
pdfinfo method got dpi=56 for max dim=1026 (points, 1/72")
2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ 
Running xpdf command: [/usr/bin/pdftoppm, -q, -f, 1, -l, 1, -r, 56, 
/tmp/DSfilt2572793846402123318.pdf, /tmp/prevu6465374009455317236out]
2012-06-20 12:54:21,606 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ 
PDFTOPPM output is: /tmp/prevu6465374009455317236out-000001.ppm, exists=false
2012-06-20 12:54:21,606 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ 
Unable to delete file

And DSpace is looking for a file with the name 
prevu6465374009455317236out-000001.ppm but pdftoppm created this file 
prevu6465374009455317236out-001.ppm hence the file not found exception.

I've fixed with a script that wraps the pdftoppm tool, and replaced the 
xpdf.path.pdftoppm parameter in dspace.cfg to point to this script.

In dspace.cfg I replaced this line:

    xpdf.path.pdftoppm = /usr/bin/pdftoppm

With this:

    xpdf.path.pdftoppm = /dspace/bin/pdftoppm.sh

And the script is in /dspace/bin/pdftoppm.sh

Please change the path to pdftoppm according to yours.

START SCRIPT --------------------------
#!/bin/sh
# Enrique Martinez
# Para corregir el nombre de archivo de la extracion de la caratula.

/usr/bin/pdftoppm $1 $2 $3 $4 $5 $6 $7 $8 $9
CONVERTRESULT=$?

SOURCEFILE=$9-*.ppm
TARGETFILE=$9-000001.ppm

for file in $SOURCEFILE; do mv $file $TARGETFILE; exit; done

return $CONVERTRESULT
END SCRIPT --------------------------


Hope this helps,


Enrique Martínez


-----Mensaje original-----
De: Richard Rodgers [mailto:[email protected]] 
Enviado el: Viernes, 22 de Junio de 2012 07:55 a.m.
Para: Kirti Bodhmage
CC: <[email protected]>
Asunto: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2

Hi Kirti:

Not sure if you are having other problems, but I did want to clarify how 
MediaFilter works.
It is a general set of tools for operating on your bitstream content, and the 
primary use for most people is to extract text (for indexing) from PDFs Word 
files etc, not to produce thumbnails of those formats.
These functions are configured in dspace.cfg (search for 'mediafilter' 
properties) - and each 'filter' is given a list of formats to process. It 
further optimizes its work by not recreating a derivative (i.e. the text file, 
or thumbnail, etc) if it already exists - that is the message you are seeing 
below (SKIPPED). 

Hope this helps,

Richard R

On Jun 22, 2012, at 6:34 AM, Kirti Bodhmage wrote:

> Hi,
> We have Dspace 1.6.2.
> I am trying to enable  thumbnail creation.
> Ran ./filter-media  in dspace/bin directory
> 
> Got following errors while executing script. After the execution I 
> could see thumbnail image for png item but couldn't see anything for 
> the pdf and other text items.
> 
> I was expecting filter-media will create image file from pdf and Word 
> documents but its creating .txt file instead.
> 
> Saw previous emails on thumbnail creation in this mailing list.  Is  
> xpdf filter is better choice for  pdfs and docs ?
> 
> -------
> SKIPPED: bitstream 401 (item: 123456789/131) because 
> 'thesisJPWoodcock1997-1.pdf.txt' already exists
> SKIPPED: bitstream 415 (item: 123456789/300) because 
> 'SHAHTransnationalHindu2009FINAL.pdf.txt' already exists
> SKIPPED: bitstream 439 (item: 123456789/135) because 
> 'CARBONI_D_FINAL.pdf.txt' already exists ERROR filtering, skipping 
> bitstream:
>        Item Handle: 123456789/136
>        Bundle Name: ORIGINAL
>        File Size: 110763816
>        Checksum: 044ce0fc33dbaf9299248cd17cf24828 (MD5)
>        Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/16/32/42/1632420477136160195540232863826226
> 39427
> (No such file or directory)
> SKIPPED: bitstream 445 (item: 123456789/137) because 
> 'RAMSDEN_PhD_FINAL.pdf.txt' already exists
> SKIPPED: bitstream 447 (item: 123456789/138) because 'WU_T_FINAL.pdf.txt'
> already exists
> SKIPPED: bitstream 578 (item: 123456789/1113) because 
> 'ADETORONumericalAndExperimental2009FINAL.pdf.txt' already exists 
> ERROR filtering, skipping bitstream:
>        Item Handle: 123456789/163
>        Bundle Name: ORIGINAL
>        Bundle Name: ORIGINAL
>        Bundle Name: ORIGINAL
>        File Size: 270417
>        Checksum: 8360d7bd72fe23ead9de220a78b047e3 (MD5)
>        Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/10/21/49/1021493254714286754976882944146128
> 38518
> (No such file or directory)
> ERROR filtering, skipping bitstream:
> 
> -----------------
> 
> Here are my settings in dpspace.cfg
> 
> ---
> webui.itemlist.columns = thumbnail, dc.date.issued(date), dc.title,
> dc.contributor.*
> webui.itemlist.widths = *, 130, 60%, 40% 
> webui.itemlist.dateaccessioned.columns = thumbnail, 
> dc.date.accessioned(date), dc.title, dc.contributor.* 
> publications.bundles.allowed = ORIGINAL, DELETED, LICENSE, THUMBNAILS 
> webui.browse.thumbnail.show = true webui.browse.thumbnail.maxheight = 
> 80 webui.browse.thumbnail.maxwidth = 80 webui.item.thumbnail.show = 
> true webui.browse.thumbnail.linkbehaviour = item # maximum width and 
> height of generated thumbnails thumbnail.maxwidth  80 
> thumbnail.maxheight 80
> 
> ---
> 
> 
> Thanks
> Kirti
> 
> 
> ----------------------------------------------------------------------
> --------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and 
> threat landscape has changed and how IT managers can respond. 
> Discussions will include endpoint security, mobile security and the 
> latest in malware threats. 
> http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and threat 
landscape has changed and how IT managers can respond. Discussions will include 
endpoint security, mobile security and the latest in malware threats. 
http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to