Hey,
 Yeah you got it right the first snippet comes in CR before the data goes in 
CAS.
And the second snippet is in the first annotator or analysis engine(AE) of my 
Aggregate Desciptor.
I am pretty sure this is an issue of the CAS used by DUCC because if i use 
service of DUCC in which we are supposed to send the CAS and receive the same 
CAS with added features from DUCC i get accurate results.

But the only problem comes in submitting a job where the cas is generated by 
DUCC.
This can also be a issue of the enviornment(Language) of DUCC because the 
default language is english.

Bets Regards
Rohit

On 2018/07/03 13:11:50, Eddie Epstein <[email protected]> wrote: 
> Rohit,
> 
> Before sending the data into jcas if i force encode it :-
> >
> > String content2 = null;
> > content2 = new String(content.getBytes("UTF-8"), "ISO-8859-1");
> > jcas.setDocumentText(content2);
> >
> 
> Where is this code, in the job CR?
> 
> 
> 
> >
> > And when i go in my first annotator i force decode it:-
> >
> > String content = null;
> > content = new String(jcas.getDocumentText.getBytes("ISO-8859-1"),
> > "UTF-8");
> >
> 
> And is this in the first annotator of the job process, i.e. the CM?
> 
> Please be as specific as possible.
> 
> Thanks,
> Eddie
> 

Reply via email to