Hi, Came across this in one of the answer in the mailing list.
--- After having changed dspace.cfg, 1. rebuild dspace, 2. run [dspace]/bin/filter-media, 3. and run [dspace]/bin/index-all --- I just change dspace.cfg for thumbnails . Do I need to rebuilt Dspace ? Thanks Kirti -----Original Message----- From: Kirti Bodhmage [mailto:[email protected]] Sent: 22 June 2012 16:37 To: [email protected] Subject: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2 Richard, Your reply explains why media-filters not generating icon or image for WU_T_FINAL.pdf instead it creates WU_T_FINAL.pdf.txt. But it did generate a thumbnail for png file. I uploaded a png file to Dspace , ran media-filter job and then refreshed the page I could see jpg next to item on browse page Attaching a screenshot. helix84, You were correct. We had a database refresh from live to test long back and not sure if we had sync on asset store. And that causing java.io.FileNotFoundException . But now the question is how to I get thumbanail for pdfs, word or any other text? Is anybody using xpdf for that ? Thanks Kirti -----Original Message----- From: Martínez Zuñiga, Enrique [mailto:[email protected]] Sent: 22 June 2012 15:55 To: <[email protected]> Subject: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2 Hi All, I've found a similar situation with 1.8.2 and the problem still is with the pdftoppm tool when extract the PDF's thumbnail, the tool has changed the output file naming schema. In the logs (in debug mode) throws this: 2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ DPI: pdfinfo method got dpi=56 for max dim=1026 (points, 1/72") 2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ Running xpdf command: [/usr/bin/pdftoppm, -q, -f, 1, -l, 1, -r, 56, /tmp/DSfilt2572793846402123318.pdf, /tmp/prevu6465374009455317236out] 2012-06-20 12:54:21,606 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ PDFTOPPM output is: /tmp/prevu6465374009455317236out-000001.ppm, exists=false 2012-06-20 12:54:21,606 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ Unable to delete file And DSpace is looking for a file with the name prevu6465374009455317236out-000001.ppm but pdftoppm created this file prevu6465374009455317236out-001.ppm hence the file not found exception. I've fixed with a script that wraps the pdftoppm tool, and replaced the xpdf.path.pdftoppm parameter in dspace.cfg to point to this script. In dspace.cfg I replaced this line: xpdf.path.pdftoppm = /usr/bin/pdftoppm With this: xpdf.path.pdftoppm = /dspace/bin/pdftoppm.sh And the script is in /dspace/bin/pdftoppm.sh Please change the path to pdftoppm according to yours. START SCRIPT -------------------------- #!/bin/sh # Enrique Martinez # Para corregir el nombre de archivo de la extracion de la caratula. /usr/bin/pdftoppm $1 $2 $3 $4 $5 $6 $7 $8 $9 CONVERTRESULT=$? SOURCEFILE=$9-*.ppm TARGETFILE=$9-000001.ppm for file in $SOURCEFILE; do mv $file $TARGETFILE; exit; done return $CONVERTRESULT END SCRIPT -------------------------- Hope this helps, Enrique Martínez -----Mensaje original----- De: Richard Rodgers [mailto:[email protected]] Enviado el: Viernes, 22 de Junio de 2012 07:55 a.m. Para: Kirti Bodhmage CC: <[email protected]> Asunto: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2 Hi Kirti: Not sure if you are having other problems, but I did want to clarify how MediaFilter works. It is a general set of tools for operating on your bitstream content, and the primary use for most people is to extract text (for indexing) from PDFs Word files etc, not to produce thumbnails of those formats. These functions are configured in dspace.cfg (search for 'mediafilter' properties) - and each 'filter' is given a list of formats to process. It further optimizes its work by not recreating a derivative (i.e. the text file, or thumbnail, etc) if it already exists - that is the message you are seeing below (SKIPPED). Hope this helps, Richard R On Jun 22, 2012, at 6:34 AM, Kirti Bodhmage wrote: > Hi, > We have Dspace 1.6.2. > I am trying to enable thumbnail creation. > Ran ./filter-media in dspace/bin directory > > Got following errors while executing script. After the execution I > could see thumbnail image for png item but couldn't see anything for > the pdf and other text items. > > I was expecting filter-media will create image file from pdf and Word > documents but its creating .txt file instead. > > Saw previous emails on thumbnail creation in this mailing list. Is > xpdf filter is better choice for pdfs and docs ? > > ------- > SKIPPED: bitstream 401 (item: 123456789/131) because > 'thesisJPWoodcock1997-1.pdf.txt' already exists > SKIPPED: bitstream 415 (item: 123456789/300) because > 'SHAHTransnationalHindu2009FINAL.pdf.txt' already exists > SKIPPED: bitstream 439 (item: 123456789/135) because > 'CARBONI_D_FINAL.pdf.txt' already exists ERROR filtering, skipping > bitstream: > Item Handle: 123456789/136 > Bundle Name: ORIGINAL > File Size: 110763816 > Checksum: 044ce0fc33dbaf9299248cd17cf24828 (MD5) > Asset Store: 0 > java.io.FileNotFoundException: > /opt/dspace/assetstore-ad1/16/32/42/1632420477136160195540232863826226 > 39427 > (No such file or directory) > SKIPPED: bitstream 445 (item: 123456789/137) because > 'RAMSDEN_PhD_FINAL.pdf.txt' already exists > SKIPPED: bitstream 447 (item: 123456789/138) because 'WU_T_FINAL.pdf.txt' > already exists > SKIPPED: bitstream 578 (item: 123456789/1113) because > 'ADETORONumericalAndExperimental2009FINAL.pdf.txt' already exists > ERROR filtering, skipping bitstream: > Item Handle: 123456789/163 > Bundle Name: ORIGINAL > Bundle Name: ORIGINAL > Bundle Name: ORIGINAL > File Size: 270417 > Checksum: 8360d7bd72fe23ead9de220a78b047e3 (MD5) > Asset Store: 0 > java.io.FileNotFoundException: > /opt/dspace/assetstore-ad1/10/21/49/1021493254714286754976882944146128 > 38518 > (No such file or directory) > ERROR filtering, skipping bitstream: > > ----------------- > > Here are my settings in dpspace.cfg > > --- > webui.itemlist.columns = thumbnail, dc.date.issued(date), dc.title, > dc.contributor.* > webui.itemlist.widths = *, 130, 60%, 40% > webui.itemlist.dateaccessioned.columns = thumbnail, > dc.date.accessioned(date), dc.title, dc.contributor.* > publications.bundles.allowed = ORIGINAL, DELETED, LICENSE, THUMBNAILS > webui.browse.thumbnail.show = true webui.browse.thumbnail.maxheight = > 80 webui.browse.thumbnail.maxwidth = 80 webui.item.thumbnail.show = > true webui.browse.thumbnail.linkbehaviour = item # maximum width and > height of generated thumbnails thumbnail.maxwidth 80 > thumbnail.maxheight 80 > > --- > > > Thanks > Kirti > ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

