i´m using hwpf to extract text from word documents. 

WordExtractor extractor = new WordExtractor(fis);

String body = extractor.getText();

Returns invalid characters like: \u0013 \u0014 \u000b

any suggestion on what the problem might be?

-- 
View this message in context: 
http://www.nabble.com/errors-in-text-extraction-tp17329385p17329385.html
Sent from the POI - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to