http://www.textmining.org/modules.php?op=modload&name=Downloads&file=index&req=getit&lid=2


On Mon, 20 Dec 2004 11:06:16 +0530, IndianAtTech <[EMAIL PROTECTED]> wrote:
> I suggest you to use Text Mining API which is built from POI libraries only
> 
> here is the site link http://www.textmining.org/
> 
> here is example
> 
> org.textmining.text.extraction.WordExtractor _word;
> _word = new org.textmining.text.extraction.WordExtractor();
> //initialise the TEXTMINING-POI word object
> 
> InputStream      _wordInput = new FileInputStream(strDocName);
> String wordTextBuffer = _word.extractText(_wordInput);
> System.out.println(wordTextBuffer);
> _wordInput.close(); //close the input stream
> _word = null;
> _wordInput = null;
> 
> Best Regards
> Sudhakar
> 
> On Mon, 20 Dec 2004 09:57:29 +0800 (CST), rec liu
> <[EMAIL PROTECTED]> wrote:
> > Hello,
> > I got some code from intenet. which extrator ms word file to text file.
> > i try it in English, it do right. but in case of Chinese characters. it
> > will short some.that's to say,only part of content was saved ,part of
> > them lost. no matter it short or long file. why? what can i do? my code
> > as follows:
> > public boolean Extrator(){
> > try
> > {
> > file = new WordDocument(fileName);
> >
> > //Writer out = new BufferedWriter(new FileWriter(outFileName));
> > Writer out = new OutputStreamWriter(new
> > FileOutputStream(outFileName),"utf-8");
> > file.writeAllText(out);
> >
> > //file.closeDoc();
> > out.flush();
> > out.close();
> > } catch(Throwable t){
> > t.printStackTrace();
> > return false;
> > }
> > return true;
> > }
> > }
> > thanks.
> > jack
> >
> >
> > ---------------------------------
> > Do You Yahoo!?
> > 150äæMP3ççæïåæéåéäæå
> > çåææåæåæïæéçåãèååéå
> > 1Gåæ1000åïéèçéèåæåï
> >
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to