Hi, I am trying to parse pdf files containing UTF8 characters using media filter,
but the resulting text bitstream is empty. Could anyone give any advice?

The PDFFilter class is using pdfbox library. Even when I changed the OutputStreamWriter that is used by PDFTextStripper to UTF8, the result was the same empty text file.

Thanks in advance,
Ilias Stavrakis



Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to