Hello, Am 13.06.2014 um 04:47 schrieb SUZUKI Keiji <[email protected]>:
> 1) Set the logging level to DEBUG and rerun. Should have done so before. Thanks you for the heads up. You were perfectly right. But then, the result leaves me a bit clueless for now: > esxh-15:/srv/dspace> tail -n 10 log/dspace.log.2014-06-15 > 2014-06-15 12:45:17,812 DEBUG org.dspace.content.BitstreamFormat @ > anonymous::find_bitstream_format:bitstream_format_id=2 > 2014-06-15 12:45:17,812 DEBUG org.dspace.storage.rdbms.DatabaseManager @ > Running query "SELECT * FROM fileextension WHERE bitstream_format_id= ? " > with parameters: 2 > 2014-06-15 12:45:17,851 DEBUG org.dspace.storage.rdbms.DatabaseManager @ > Running query "select * from bitstream where bitstream_id = ? " with > parameters: 27442 > 2014-06-15 12:45:17,852 DEBUG > org.dspace.storage.bitstore.BitstreamStorageManager @ Local filename for > 87066288396181747611585923333395102959 is > /srv/dspace/assetstore/87/06/62/87066288396181747611585923333395102959 > 2014-06-15 12:45:17,865 INFO net.sf.ehcache.util.UpdateChecker @ New > update(s) found: 2.4.7 > [http://www.terracotta.org/confluence/display/release/Release+Notes+Ehcache+Core+2.4] > 2014-06-15 12:45:17,919 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ > DPI: pdfinfo method got dpi=75 for max dim=759 (points, 1/72") > 2014-06-15 12:45:17,920 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ > Running xpdf command: [/usr/bin/pdftoppm, -q, -f, 1, -l, 1, -r, 75, > /tmp/DSfilt2327548125683453130.pdf, /tmp/prevu8591868713129272046out] > 2014-06-15 12:45:18,357 DEBUG org.dspace.app.mediafilter.XPDF2Thumbnail @ > PDFTOPPM output is: /tmp/prevu8591868713129272046out-000001.ppm, exists=false > 2014-06-15 12:45:18,420 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ > Unable to delete file > 2014-06-15 12:45:18,421 DEBUG org.dspace.storage.rdbms.DatabaseManager @ > Running query "SELECT bundle.* FROM bundle, bundle2bitstream WHERE > bundle.bundle_id=bundle2bitstream.bundle_id AND > bundle2bitstream.bitstream_id= ? " with parameters: 27442 > esxh-15:/srv/dspace> ls -l /tmp > insgesamt 1272 > drwx------ 2 amanda backup 4096 Jun 15 11:27 amanda > drwxr-xr-x 2 root root 4096 Jun 15 12:17 hsperfdata_root > drwxr-xr-x 2 tomcat7 tomcat7 4096 Jun 15 12:45 hsperfdata_tomcat7 > -rw-r--r-- 1 tomcat7 tomcat7 1281435 Jun 15 12:45 > prevu8591868713129272046out-1.ppm > drwxr-xr-x 2 tomcat7 root 4096 Jun 15 12:12 tomcat7-tomcat7-tmp > drwx------ 2 root root 4096 Jun 15 11:26 vmware-root > esxh-15:/srv/dspace> This means, the enumeration scheme used by pdftoppm for writing image files from several pages is different from what the XPDF Plugin expects. If I got it right, the plugin tells pdftoppm to do this: /usr/bin/pdftoppm -q -f 1 -l 1 -r 75 /tmp/DSfilt2327548125683453130.pdf /tmp/prevu8591868713129272046out It expects to find the resulting file here: /tmp/prevu8591868713129272046out-000001.ppm However, the file gets written here: /tmp/prevu8591868713129272046out-1.ppm Everything is fine regarding file permissions, the file is in the expected directory /tmp, only the six digits instead of a single digit make the difference. There are several questions here. Why does the filter write a .ppm file and not a .jpg file using the -jpeg option of pdftoppm and when does the actual conversion happen? The task of the filter is always to produce a thumbnail image of the first page. So it would seem much more logical and robust to me to use the -singlepage attribute of pdftoppm which does not add anything to the output name besides the file extension. Instead first page -f and last page -l are set to 1. But well I would not need to bother if everything worked fine. Where does this six digit rule get set? During my tests I had produced thousands of files starting with /tmp/prevu*. Most of them ended on -1.ppm, but some of them on -01.ppm. Mysterious. I will try to produce the same fault on my test system which works fine for now, just to understand where are the differences. For now, I wont try the second suggestion to recompile with source code commented out, because I guess, I already found the issue, just dont understand it yet. Thanks for your support. Further suggestions welcome. Bye, Christian ------------------------------------------------------------------------------ HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing & Easy Data Exploration http://p.sf.net/sfu/hpccsystems _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

