[Dspace-tech] filter media over UTF8 files

Hlias Stavrakis Mon, 08 Dec 2008 05:23:55 -0800

Hi, I am trying to parse pdf files containing UTF8 characters using media filter,

but the resulting text bitstream is empty. Could anyone give any advice?

The PDFFilter class is using pdfbox library. Even when I changed the OutputStreamWriter that is used by PDFTextStripper to UTF8, the result was the same empty text file.


Thanks in advance,
Ilias Stavrakis

smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/

_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

[Dspace-tech] filter media over UTF8 files

Reply via email to