On 3/29/07, Shawna Sadler <[EMAIL PROTECTED]> wrote: > We've loaded them into DSpace and we're noticing very inconsistent > behavior with MediaFilter. Some of the theses have extracted text and > some have blank .txt files.
Are you completely sure the theses were microfilmed? If any of them were submitted as PDFs created directly from Word (or LaTeX or whatever), they would have actual text for MediaFilter to extract, as opposed to pictures of text (which you'd get by scanning print or microfilm, and which MediaFilter can't do anything with as it is not an OCR engine). Dorothea ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

