No problem about the misunderstanding. I am using InputStream input =new URL ( "file:///"+file.getAbsolutePath() ).openStream (); WordExtractor extractor = new WordExtractor(input); content=extractor.getText();
where the wordextractor is org.apache.poi.hwpf.extractor.WordExtractor; The wordextractor takes an inputstream as an arguement. Should i determine the encoding of the inputstream and how? -- View this message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15545082.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]