BTW, UMich would be very interested in this functionality. Even if there was a way to pull this additional information on ingest and keep it with the "item" in a reportable way as we have 140K+ of PDFs and we have no idea what flavor they really are. It seems that this info could be saved the PREMIS metadata section.
On Monday, August 8, 2022 at 4:38:01 PM UTC-4 hardy.p...@gmail.com wrote: > Hi, wow, it has been a while since I've written to DSpace-tech. :-) > > I'm writing to ask, would anyone happen to know if someone has created a > curation task to validate PDF files, perhaps using JHOVE [1], or PDFBox > [2]? I did a quick search and came up empty, but, I'm sure I can't be the > only dev working with a repository with a handful of known bad PDFs and > thinking "there has to be a better way"? > > Thanks for your help! > > --Hardy > > 1. https://jhove.openpreservation.org/ > 2. https://pdfbox.apache.org/ > > -- All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx --- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to dspace-tech+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/a73a48c9-da76-47d0-982d-1bfd705eafe1n%40googlegroups.com.