Dorothea Salo wrote: > You didn't say what version of DSpace you're running (and honestly, > I'm not completely sure this was fixed in 1.5 -- anybody know?), > but... one thing that may be happening is that the filter-media cron > job is dying. Since it's written without error-recovery, it stops dead > at the first file it thinks it should be able to handle but can't. > > Run it from the command-line and see if it errors out. If I'm right, > there's no obvious workaround I'm aware of, though somebody (Tim?) may > have hacked one. > > Dorothea >
The filter-media in 1.5 is a bit more robust. If it hits an Exception when dealing with one file, it will attempt to clean itself up a bit and carry on with the next one. In the cases where PDF extraction is failing due to a PDFBox bug, this is usually good enough for it to finish the filtering normally (excluding the file that caused the problem). However, I can't guarantee that will be enough in this case. But then judging by Mike's message, it's possible that filter-media wasn't even run at all. (only index-all is mentioned) G This e-mail is confidential and should not be used by anyone who is not the original intended recipient. BioMed Central Limited does not accept liability for any statements made which are clearly the sender's own and not expressly made on behalf of BioMed Central Limited. No contracts may be concluded on behalf of BioMed Central Limited by means of e-mail communication. BioMed Central Limited Registered in England and Wales with registered number 3680030 Registered Office Middlesex House, 34-42 Cleveland Street, London W1T 4LB This email has been scanned by Postini. For more information please visit http://www.postini.com ------------------------------------------------------------------------- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

