Re: Problem in running DUCC Job for Arabic Language

rohit14csu173 Thu, 05 Jul 2018 03:50:18 -0700

Hey,
 Yeah you got it right the first snippet comes in CR before the data goes in 
CAS.
And the second snippet is in the first annotator or analysis engine(AE) of my 
Aggregate Desciptor.
I am pretty sure this is an issue of the CAS used by DUCC because if i use 
service of DUCC in which we are supposed to send the CAS and receive the same 
CAS with added features from DUCC i get accurate results.


But the only problem comes in submitting a job where the cas is generated by 
DUCC.
This can also be a issue of the enviornment(Language) of DUCC because the 
default language is english.

Bets Regards
Rohit

On 2018/07/03 13:11:50, Eddie Epstein <[email protected]> wrote: 
> Rohit,
> 
> Before sending the data into jcas if i force encode it :-
> >
> > String content2 = null;
> > content2 = new String(content.getBytes("UTF-8"), "ISO-8859-1");
> > jcas.setDocumentText(content2);
> >
> 
> Where is this code, in the job CR?
> 
> 
> 
> >
> > And when i go in my first annotator i force decode it:-
> >
> > String content = null;
> > content = new String(jcas.getDocumentText.getBytes("ISO-8859-1"),
> > "UTF-8");
> >
> 
> And is this in the first annotator of the job process, i.e. the CM?
> 
> Please be as specific as possible.
> 
> Thanks,
> Eddie
>

Re: Problem in running DUCC Job for Arabic Language

Reply via email to