Chen brings up a good point. But can't we solve this problem by creating
multiple Pair annotations in the CAS?
Dima
On 11/15/2012 01:52 PM, Lin, Chen wrote:
I am curious to know if Pair allows multiple document level labels for a single
doc. It is possible that a single set of documents be used in multiple
classification tasks.
For example, in one task a document may be labeled as "positive" or "negative", in another task this same doc
may be labeled as "high", "moderate" or "low". Many thanks!
Best,
Chen
-----Original Message-----
From: Dmitriy Dligach [mailto:[email protected]]
Sent: Thursday, November 15, 2012 1:46 PM
To: [email protected]
Subject: Re: new type: document label?
Thank you, James.
So, in general did you envision this type of use for Pair:
Pair.attribute -> "document_label"
Pair.value -> "positive"
I think this may work.
Dima
On 11/15/2012 10:22 AM, Masanz, James J. wrote:
Pair (org.apache.ctakes.typesystem.type.util.Pair) is intended for such
document-level properties.
Would that suit your need?
-- James
-----Original Message-----
From:
[email protected]
[mailto:ctakes-dev-return-854-
[email protected]] On Behalf Of Dmitriy
Dligach
Sent: Thursday, November 15, 2012 9:16 AM
To: cTAKES Dev list @ ASF
Subject: new type: document label?
We've recently been using cTAKES more and more for document-level
classification (e.g. phenotyping). Would it make sense to add a new
type (that would derive from TOP) to store the label for a document?
I know we currently have a doc id for each document, but having the
label type would simplify a lot of things (e.g. debugging).
Thanks,
Dima