Does anyone have a script that checks all of the previously uploaded PDFs and find ones that are malformed and reports their URLs/record IDs?
I can see how to write a script that uses the unix command line 'file' and 'pdftops' tools to check that every file that looks like a PDF is a good and valid PDF. Going from a file on the disk to a database record I'm not too sure of. cheers stuart -- Stuart Yeates http://www.nzetc.org/ New Zealand Electronic Text Centre http://researcharchive.vuw.ac.nz/ Institutional Repository ------------------------------------------------------------------------------ Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

