RE: CASConsumer: Was: cTAKES - Context Annotation

Chen, Pei Wed, 01 May 2013 06:34:49 -0700

Andy,
Is the consumer flexible enough to allow the user to select/configure which 
types they wish to store?
Either way, I would suggest opening up a Jira and adding the code (perhaps in 
the sandbox area first), especially if you're also planning to help maintain it.


Thanks,
--Pei

> -----Original Message-----
> From: Andy McMurry [mailto:[email protected]]
> Sent: Wednesday, May 01, 2013 6:16 AM
> To: [email protected]
> Subject: Re: cTAKES - Context Annotation
> 
> RE: Context Dependent Annotation
> 
> I had needs for fast local write (Consumer) that could directly be imported
> into a SQL database for further analysis independent of cTakes.
> Consuming annotations are MUCH faster than what I was doing before
> 
> Supports fast CSV->SQL inserts for the following types in a standard database
> table format
> * Sentence (annotation)
> * BaseToken
> * NewLineToken
> * ContractionToken
> * PunctuationToken
> * SymbolToken
> * WordToken
> * NumToken
> 
> Creates database tables for each token type.
> 
> If folks are interested in "yet another annotation consumer" I can upload.
> 
> --Andy
> 
> 
> 
> On May 1, 2013, at 3:02 AM, "Masanz, James J."
> <[email protected]> wrote:
> 
> > I was thinking something like this:
> >
> > public void process(JCas jcas) throws AnalysisEngineProcessException {
> > for (Sentence sentence : JCasUtil.select(jcas, Sentence.class)) {  int
> > sentenceBegin = sentence.getBegin();  int fromIndex = 0;  String s =
> > sentence.getCoveredText();  while (fromIndex < s.length()) {
> >    LookupWindowAnnotation lwa = new LookupWindowAnnotation(jcas);
> >    lwa.setBegin(sentenceBegin + fromIndex);
> >    int commaPosition = s.indexOf(COMMA, fromIndex);
> >    if (commaPosition > 0) {
> >       lwa.setEnd(sentenceBegin+commaPosition+1); // include the comma
> >    } else {
> >       lwa.setEnd(sentence.getEnd());
> >    }
> >    lwa.addToIndexes(jcas);
> >    fromIndex = lwa.getEnd()-sentenceBegin;  } }
> > )
> >
> > -- James
> >
> >> -----Original Message-----
> >> From: [email protected]
> >> [mailto:dev- [email protected]]
> On
> >> Behalf Of Piyush Jain
> >> Sent: Tuesday, April 30, 2013 8:55 AM
> >> To: [email protected]
> >> Subject: Re: cTAKES - Context Annotation
> >>
> >> Hello James,
> >>
> >> Thanks for your reply. Apologies for late reply.
> >>
> >> Could you please help me on how can I create a new
> >> WindowAnnotationClass for keeping comma out of scope of a sentence?
> >>
> >> Regards,
> >> Piyush
> >>
> >>
> >> On Wed, Apr 3, 2013 at 11:14 AM, Masanz, James J.
> >> <[email protected]>wrote:
> >>
> >>> In addition to "MaxLeftScopeSize" and "MaxRightScopeSize", the
> >>> context annotations created by ctakes-ne-contexts use the parameter
> called
> >>> WindowAnnotationClass to control the boudaries.   By default, when the
> >>> context annotator collects the context annotations for a named
> >>> entity, it will not look beyond the boundaries of the sentence that
> >>> the named entity is found in, because within NegationAnnotator.xml
> >>> and within StatusAnnotator.xml, WindowAnnotationClass is set to be
> >>> Sentence
> >>> (org.apache.ctakes.typesystem.type.textspan.Sentence)
> >>>
> >>> If you want a narrower range (to not cross commas) then I suggest
> >>> you create new annotations whose boundaries are determined by the
> >>> start and end of sentences and by occurrences of commas, and use the
> >>> name of that new annotation as the WindowAnnotationClass.
> >>>
> >>> -- James
> >>>
> >>>
> >>>> -----Original Message-----
> >>>> From: [email protected]
> [mailto:
> >>> dev-
> >>>> [email protected]] On Behalf
> Of
> >>>> Piyush Jain
> >>>> Sent: Monday, April 01, 2013 10:56 AM
> >>>> To: [email protected]
> >>>> Subject: cTAKES - Context Annotation
> >>>>
> >>>> Hell Sir/Madam,
> >>>>
> >>>> I am new to this forum and I have few question regarding cTAKES.
> >>>> As of now, I am posting my question in this email only, please let
> >>>> me
> >>> know
> >>>> if I have to post my questions somewhere else.
> >>>>
> >>>> Question:
> >>>> I am using cTAKES context annotation. I am trying to keep
> >>>> "MaxLeftScopeSize" and "MaxRightScopeSize" as 2, earlier it was 10.
> >>>> Also, I need to keep the context annotation look up to be limited
> >>>> to same sentence and same "," span.
> >>>>
> >>>> Could you please help me with this or if you are not the right
> >>>> person could you please redirect me to right person?
> >>>>
> >>>>
> >>>> Regards,
> >>>> Piyush
> >>>

RE: CASConsumer: Was: cTAKES - Context Annotation

Reply via email to