i´m using hwpf to extract text from word documents. WordExtractor extractor = new WordExtractor(fis);
String body = extractor.getText(); Returns invalid characters like: \u0013 \u0014 \u000b any suggestion on what the problem might be? -- View this message in context: http://www.nabble.com/errors-in-text-extraction-tp17329385p17329385.html Sent from the POI - User mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
