Hi, I have just have couple of clarifications. cTakes uses various NLP open source libraries for sentence tokenization, pos tagging and chunking. Can anyone tell me what is the trained model used for pos tagging, chunking ? Is it based on Genia corpus. I tried using genia tagger but it is giving me different results from the cTakes. Can anyone suggest me some ideas on incorporating domain specific corpora for tagging and chunking in cTakes ?
Regards, Prasanna
