On 3/29/07, Shawna Sadler <[EMAIL PROTECTED]> wrote:

> We've loaded them into DSpace and we're noticing very inconsistent
> behavior with MediaFilter. Some of the theses have extracted text and
> some have blank .txt files.

Are you completely sure the theses were microfilmed? If any of them
were submitted as PDFs created directly from Word (or LaTeX or
whatever), they would have actual text for MediaFilter to extract, as
opposed to pictures of text (which you'd get by scanning print or
microfilm, and which MediaFilter can't do anything with as it is not
an OCR engine).

Dorothea

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to