What format (features, labels) is best suitable for some more training examples?

The SubjectCleartkAnalysisEngine class loads a 
/org/apache/ctakes/assertion/models/subject/model.jar, which contains a 
liblinear cleartk model. 

The model has 3 features, label 12 3. 

But what are the features exactly are how are they derived? 

How does the target class look like, is is really differentiating between 
"patient", "brother", "sister" etc. or is it a binary decision model between 
"patient" and "family_history" (the latter is what is looks to me) ? 

This is not documented.

Tomasz

Reply via email to