Does anyone have a script that checks all of the previously uploaded 
PDFs and find ones that are malformed and reports their URLs/record IDs?

I can see how to write a script that uses the unix command line 'file' 
and 'pdftops' tools to check that every file that looks like a PDF is a 
good and valid PDF. Going from a file on the disk to a database record 
I'm not too sure of.

cheers
stuart
-- 
Stuart Yeates
http://www.nzetc.org/       New Zealand Electronic Text Centre
http://researcharchive.vuw.ac.nz/     Institutional Repository

------------------------------------------------------------------------------
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to