Hi,

Came across this in one of the answer in the mailing list.

---
After having changed dspace.cfg,
1. rebuild dspace,
2. run [dspace]/bin/filter-media,
3. and run [dspace]/bin/index-all
---

I just change dspace.cfg for thumbnails .  Do I need to rebuilt Dspace ?

Thanks
Kirti


-----Original Message-----
From: Kirti Bodhmage [mailto:[email protected]] 
Sent: 22 June 2012 16:37
To: [email protected]
Subject: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2

Richard,
Your reply explains why media-filters not generating icon or image for
WU_T_FINAL.pdf instead it creates WU_T_FINAL.pdf.txt. 

But it did generate a thumbnail for png file.
I uploaded  a png file to Dspace , ran media-filter job and then refreshed
the page I could see jpg next to item on browse page Attaching a screenshot.

helix84,
You were correct.  We had a database refresh from live to test long back and
not sure if we had sync on asset store. And that causing
java.io.FileNotFoundException .

But now the question is how to I get thumbanail for pdfs, word or any other
text?  Is anybody using xpdf  for that ?

Thanks
Kirti


-----Original Message-----
From: Martínez Zuñiga, Enrique [mailto:[email protected]]
Sent: 22 June 2012 15:55
To: <[email protected]>
Subject: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2

Hi All,

I've found a similar situation with 1.8.2 and the problem still is with the
pdftoppm tool when extract the PDF's thumbnail, the tool has changed the
output file naming schema.

In the logs (in debug mode) throws this:

2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @
DPI: pdfinfo method got dpi=56 for max dim=1026 (points, 1/72")
2012-06-20 12:54:21,219 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @
Running xpdf command: [/usr/bin/pdftoppm, -q, -f, 1, -l, 1, -r, 56,
/tmp/DSfilt2572793846402123318.pdf, /tmp/prevu6465374009455317236out]
2012-06-20 12:54:21,606 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @
PDFTOPPM output is: /tmp/prevu6465374009455317236out-000001.ppm,
exists=false
2012-06-20 12:54:21,606 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @
Unable to delete file

And DSpace is looking for a file with the name
prevu6465374009455317236out-000001.ppm but pdftoppm created this file
prevu6465374009455317236out-001.ppm hence the file not found exception.

I've fixed with a script that wraps the pdftoppm tool, and replaced the
xpdf.path.pdftoppm parameter in dspace.cfg to point to this script.

In dspace.cfg I replaced this line:

    xpdf.path.pdftoppm = /usr/bin/pdftoppm

With this:

    xpdf.path.pdftoppm = /dspace/bin/pdftoppm.sh

And the script is in /dspace/bin/pdftoppm.sh

Please change the path to pdftoppm according to yours.

START SCRIPT --------------------------
#!/bin/sh
# Enrique Martinez
# Para corregir el nombre de archivo de la extracion de la caratula.

/usr/bin/pdftoppm $1 $2 $3 $4 $5 $6 $7 $8 $9 CONVERTRESULT=$?

SOURCEFILE=$9-*.ppm
TARGETFILE=$9-000001.ppm

for file in $SOURCEFILE; do mv $file $TARGETFILE; exit; done

return $CONVERTRESULT
END SCRIPT --------------------------


Hope this helps,


Enrique Martínez


-----Mensaje original-----
De: Richard Rodgers [mailto:[email protected]] Enviado el: Viernes, 22 de
Junio de 2012 07:55 a.m.
Para: Kirti Bodhmage
CC: <[email protected]>
Asunto: Re: [Dspace-tech] Thumbnail Priview. in 1.6.2

Hi Kirti:

Not sure if you are having other problems, but I did want to clarify how
MediaFilter works.
It is a general set of tools for operating on your bitstream content, and
the primary use for most people is to extract text (for indexing) from PDFs
Word files etc, not to produce thumbnails of those formats.
These functions are configured in dspace.cfg (search for 'mediafilter'
properties) - and each 'filter' is given a list of formats to process. It
further optimizes its work by not recreating a derivative (i.e. the text
file, or thumbnail, etc) if it already exists - that is the message you are
seeing below (SKIPPED). 

Hope this helps,

Richard R

On Jun 22, 2012, at 6:34 AM, Kirti Bodhmage wrote:

> Hi,
> We have Dspace 1.6.2.
> I am trying to enable  thumbnail creation.
> Ran ./filter-media  in dspace/bin directory
> 
> Got following errors while executing script. After the execution I 
> could see thumbnail image for png item but couldn't see anything for 
> the pdf and other text items.
> 
> I was expecting filter-media will create image file from pdf and Word 
> documents but its creating .txt file instead.
> 
> Saw previous emails on thumbnail creation in this mailing list.  Is 
> xpdf filter is better choice for  pdfs and docs ?
> 
> -------
> SKIPPED: bitstream 401 (item: 123456789/131) because 
> 'thesisJPWoodcock1997-1.pdf.txt' already exists
> SKIPPED: bitstream 415 (item: 123456789/300) because 
> 'SHAHTransnationalHindu2009FINAL.pdf.txt' already exists
> SKIPPED: bitstream 439 (item: 123456789/135) because 
> 'CARBONI_D_FINAL.pdf.txt' already exists ERROR filtering, skipping
> bitstream:
>        Item Handle: 123456789/136
>        Bundle Name: ORIGINAL
>        File Size: 110763816
>        Checksum: 044ce0fc33dbaf9299248cd17cf24828 (MD5)
>        Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/16/32/42/1632420477136160195540232863826226
> 39427
> (No such file or directory)
> SKIPPED: bitstream 445 (item: 123456789/137) because 
> 'RAMSDEN_PhD_FINAL.pdf.txt' already exists
> SKIPPED: bitstream 447 (item: 123456789/138) because 'WU_T_FINAL.pdf.txt'
> already exists
> SKIPPED: bitstream 578 (item: 123456789/1113) because 
> 'ADETORONumericalAndExperimental2009FINAL.pdf.txt' already exists 
> ERROR filtering, skipping bitstream:
>        Item Handle: 123456789/163
>        Bundle Name: ORIGINAL
>        Bundle Name: ORIGINAL
>        Bundle Name: ORIGINAL
>        File Size: 270417
>        Checksum: 8360d7bd72fe23ead9de220a78b047e3 (MD5)
>        Asset Store: 0
> java.io.FileNotFoundException:
> /opt/dspace/assetstore-ad1/10/21/49/1021493254714286754976882944146128
> 38518
> (No such file or directory)
> ERROR filtering, skipping bitstream:
> 
> -----------------
> 
> Here are my settings in dpspace.cfg
> 
> ---
> webui.itemlist.columns = thumbnail, dc.date.issued(date), dc.title,
> dc.contributor.*
> webui.itemlist.widths = *, 130, 60%, 40% 
> webui.itemlist.dateaccessioned.columns = thumbnail, 
> dc.date.accessioned(date), dc.title, dc.contributor.* 
> publications.bundles.allowed = ORIGINAL, DELETED, LICENSE, THUMBNAILS 
> webui.browse.thumbnail.show = true webui.browse.thumbnail.maxheight =
> 80 webui.browse.thumbnail.maxwidth = 80 webui.item.thumbnail.show = 
> true webui.browse.thumbnail.linkbehaviour = item # maximum width and 
> height of generated thumbnails thumbnail.maxwidth  80 
> thumbnail.maxheight 80
> 
> ---
> 
> 
> Thanks
> Kirti
> 


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to