Dear Thilo Goetz
Thank you for your response

I have aleardy tried different ways of reading text file with different
encodings.

For example using commons IO FileUtils class, I tried as follows

............
String document = FileUtils.file2String(inputFile, "UTF-8");
tcas.setDocumentText(document);
tae.process(tcas);
                   ......

It again stuck at process() method. It seems the problem is with that method

thank you very mucb

On Sun, May 27, 2012 at 8:42 AM, Thilo Goetz <[email protected]> wrote:

> On 26/05/12 23:13, Seid Muhie wrote:
> > dear all
> > I have Unicode document I want to process.
> > Following the tutorial at
> > this<http://www.ibm.com/developerworks/webservices/tutorials/ws-uima/>,
> > the code stucks at the last line.
> >
> >                         File taeDescriptor = new
> > File("desc\\DateAnnotatorAEDescriptor.xml");
> > File inputFile = new File("data\\document1.txt");
> > XMLInputSource in = new XMLInputSource(taeDescriptor);
> > ResourceSpecifier specifier =
> > UIMAFramework.getXMLParser().parseResourceSpecifier(in);
> > AnalysisEngine tae = UIMAFramework.produceAnalysisEngine(specifier);
> > CAS tcas = tae.newCAS();
> > FileInputStream fis = new FileInputStream(inputFile);
> > byte[] contents = new byte[(int) inputFile.length()];
> > fis.read(contents);
> > fis.close();
> > String document = new String(contents);
> > tcas.setDocumentText(document);
> > *tae.process(tcas);*
> >
> > thank you.
> >
>
> Please check the web on how to read in a text file with
> a specific encoding.  An easy way is to use commons io.
>
> --Thilo
>
>


-- 
Seid M.
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Dream bright, success will follow!
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Reply via email to