Hi All,
I've found a similar situation with 1.8.2 and the problem still is with the
pdftoppm tool when extract the PDF's thumbnail, the tool has changed the output
file naming schema.
In the logs (in debug mode) throws this:
2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ DPI:
pdfinfo method got dpi=56 for max dim=1026 (points, 1/72")
2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @
Running xpdf command: [/usr/bin/pdftoppm, -q, -f, 1, -l, 1, -r, 56,
/tmp/DSfilt2572793846402123318.pdf, /tmp/prevu6465374009455317236out]
2012-06-20 12:54:21,606 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @
PDFTOPPM output is: /tmp/prevu6465374009455317236out-000001.ppm, exists=false
2012-06-20 12:54:21,606 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @
Unable to delete file
And DSpace is looking for a file with the name
prevu6465374009455317236out-000001.ppm but pdftoppm created this file
prevu6465374009455317236out-001.ppm hence the file not found exception.
I've fixed with a script that wraps the pdftoppm tool, and replaced the
xpdf.path.pdftoppm parameter in dspace.cfg to point to this script.
In dspace.cfg I replaced this line:
xpdf.path.pdftoppm = /usr/bin/pdftoppm
With this:
xpdf.path.pdftoppm = /dspace/bin/pdftoppm.sh
And the script is in /dspace/bin/pdftoppm.sh
Please change the path to pdftoppm according to yours.
START SCRIPT --------------------------
#!/bin/sh
# Enrique Martinez
# Para corregir el nombre de archivo de la extracion de la caratula.
/usr/bin/pdftoppm $1 $2 $3 $4 $5 $6 $7 $8 $9
CONVERTRESULT=$?
SOURCEFILE=$9-*.ppm
TARGETFILE=$9-000001.ppm
for file in $SOURCEFILE; do mv $file $TARGETFILE; exit; done
return $CONVERTRESULT
END SCRIPT --------------------------
Hope this helps,
Enrique Martínez
-----Mensaje original-----
De: Richard Rodgers [mailto:[email protected]]
Enviado el: Viernes, 22 de Junio de 2012 07:55 a.m.
Para: Kirti Bodhmage
CC: <[email protected]>
Asunto: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2
Hi Kirti:
Not sure if you are having other problems, but I did want to clarify how
MediaFilter works.
It is a general set of tools for operating on your bitstream content, and the
primary use for most people is to extract text (for indexing) from PDFs Word
files etc, not to produce thumbnails of those formats.
These functions are configured in dspace.cfg (search for 'mediafilter'
properties) - and each 'filter' is given a list of formats to process. It
further optimizes its work by not recreating a derivative (i.e. the text file,
or thumbnail, etc) if it already exists - that is the message you are seeing
below (SKIPPED).
Hope this helps,
Richard R
On Jun 22, 2012, at 6:34 AM, Kirti Bodhmage wrote:
> Hi,
> We have Dspace 1.6.2.
> I am trying to enable thumbnail creation.
> Ran ./filter-media in dspace/bin directory
>
> Got following errors while executing script. After the execution I
> could see thumbnail image for png item but couldn't see anything for
> the pdf and other text items.
>
> I was expecting filter-media will create image file from pdf and Word
> documents but its creating .txt file instead.
>
> Saw previous emails on thumbnail creation in this mailing list. Is
> xpdf filter is better choice for pdfs and docs ?
>
> -------
> SKIPPED: bitstream 401 (item: 123456789/131) because
> 'thesisJPWoodcock1997-1.pdf.txt' already exists
> SKIPPED: bitstream 415 (item: 123456789/300) because
> 'SHAHTransnationalHindu2009FINAL.pdf.txt' already exists
> SKIPPED: bitstream 439 (item: 123456789/135) because
> 'CARBONI_D_FINAL.pdf.txt' already exists ERROR filtering, skipping
> bitstream:
> Item Handle: 123456789/136
> Bundle Name: ORIGINAL
> File Size: 110763816
> Checksum: 044ce0fc33dbaf9299248cd17cf24828 (MD5)
> Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/16/32/42/1632420477136160195540232863826226
> 39427
> (No such file or directory)
> SKIPPED: bitstream 445 (item: 123456789/137) because
> 'RAMSDEN_PhD_FINAL.pdf.txt' already exists
> SKIPPED: bitstream 447 (item: 123456789/138) because 'WU_T_FINAL.pdf.txt'
> already exists
> SKIPPED: bitstream 578 (item: 123456789/1113) because
> 'ADETORONumericalAndExperimental2009FINAL.pdf.txt' already exists
> ERROR filtering, skipping bitstream:
> Item Handle: 123456789/163
> Bundle Name: ORIGINAL
> Bundle Name: ORIGINAL
> Bundle Name: ORIGINAL
> File Size: 270417
> Checksum: 8360d7bd72fe23ead9de220a78b047e3 (MD5)
> Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/10/21/49/1021493254714286754976882944146128
> 38518
> (No such file or directory)
> ERROR filtering, skipping bitstream:
>
> -----------------
>
> Here are my settings in dpspace.cfg
>
> ---
> webui.itemlist.columns = thumbnail, dc.date.issued(date), dc.title,
> dc.contributor.*
> webui.itemlist.widths = *, 130, 60%, 40%
> webui.itemlist.dateaccessioned.columns = thumbnail,
> dc.date.accessioned(date), dc.title, dc.contributor.*
> publications.bundles.allowed = ORIGINAL, DELETED, LICENSE, THUMBNAILS
> webui.browse.thumbnail.show = true webui.browse.thumbnail.maxheight =
> 80 webui.browse.thumbnail.maxwidth = 80 webui.item.thumbnail.show =
> true webui.browse.thumbnail.linkbehaviour = item # maximum width and
> height of generated thumbnails thumbnail.maxwidth 80
> thumbnail.maxheight 80
>
> ---
>
>
> Thanks
> Kirti
>
>
> ----------------------------------------------------------------------
> --------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and
> threat landscape has changed and how IT managers can respond.
> Discussions will include endpoint security, mobile security and the
> latest in malware threats.
> http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and threat
landscape has changed and how IT managers can respond. Discussions will include
endpoint security, mobile security and the latest in malware threats.
http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech