http://www.textmining.org/modules.php?op=modload&name=Downloads&file=index&req=getit&lid=2
On Mon, 20 Dec 2004 11:06:16 +0530, IndianAtTech <[EMAIL PROTECTED]> wrote: > I suggest you to use Text Mining API which is built from POI libraries only > > here is the site link http://www.textmining.org/ > > here is example > > org.textmining.text.extraction.WordExtractor _word; > _word = new org.textmining.text.extraction.WordExtractor(); > //initialise the TEXTMINING-POI word object > > InputStream _wordInput = new FileInputStream(strDocName); > String wordTextBuffer = _word.extractText(_wordInput); > System.out.println(wordTextBuffer); > _wordInput.close(); //close the input stream > _word = null; > _wordInput = null; > > Best Regards > Sudhakar > > On Mon, 20 Dec 2004 09:57:29 +0800 (CST), rec liu > <[EMAIL PROTECTED]> wrote: > > Hello, > > I got some code from intenet. which extrator ms word file to text file. > > i try it in English, it do right. but in case of Chinese characters. it > > will short some.that's to say,only part of content was saved ,part of > > them lost. no matter it short or long file. why? what can i do? my code > > as follows: > > public boolean Extrator(){ > > try > > { > > file = new WordDocument(fileName); > > > > //Writer out = new BufferedWriter(new FileWriter(outFileName)); > > Writer out = new OutputStreamWriter(new > > FileOutputStream(outFileName),"utf-8"); > > file.writeAllText(out); > > > > //file.closeDoc(); > > out.flush(); > > out.close(); > > } catch(Throwable t){ > > t.printStackTrace(); > > return false; > > } > > return true; > > } > > } > > thanks. > > jack > > > > > > --------------------------------- > > Do You Yahoo!? > > 150äæMP3ççæïåæéåéäæå > > çåææåæåæïæéçåãèååéå > > 1Gåæ1000åïéèçéèåæåï > > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
