I am curious to know if Pair allows multiple document level labels for a single doc. It is possible that a single set of documents be used in multiple classification tasks. For example, in one task a document may be labeled as "positive" or "negative", in another task this same doc may be labeled as "high", "moderate" or "low". Many thanks!
Best, Chen -----Original Message----- From: Dmitriy Dligach [mailto:[email protected]] Sent: Thursday, November 15, 2012 1:46 PM To: [email protected] Subject: Re: new type: document label? Thank you, James. So, in general did you envision this type of use for Pair: Pair.attribute -> "document_label" Pair.value -> "positive" I think this may work. Dima On 11/15/2012 10:22 AM, Masanz, James J. wrote: > Pair (org.apache.ctakes.typesystem.type.util.Pair) is intended for such > document-level properties. > Would that suit your need? > > -- James > >> -----Original Message----- >> From: >> [email protected] >> [mailto:ctakes-dev-return-854- >> [email protected]] On Behalf Of Dmitriy >> Dligach >> Sent: Thursday, November 15, 2012 9:16 AM >> To: cTAKES Dev list @ ASF >> Subject: new type: document label? >> >> We've recently been using cTAKES more and more for document-level >> classification (e.g. phenotyping). Would it make sense to add a new >> type (that would derive from TOP) to store the label for a document? >> I know we currently have a doc id for each document, but having the >> label type would simplify a lot of things (e.g. debugging). >> >> Thanks, >> >> Dima
