RE: cTAKES corpus
Thank you very much for your response Jose Posada Department of Biomedical Informatics University of Pittsburgh -Original Message- From: Pei Chen [mailto:chen...@apache.org] Sent: Thursday, November 12, 2015 3:34 PM To: dev@ctakes.apache.org Subject: Re: cTAKES corpus Hi Jose, There were some previous discussions[1] on how to get the annotated training data. Essentially, there currently isn't a centralized or easy way of getting w/o having to sign individual Data Use Agreements from source institutions. There is a clear need to simplify this and I believe the various groups are working on it... [1] http://mail-archives.apache.org/mod_mbox/ctakes-dev/201503.mbox/%3CCA+Fyf6hxBbhhEqc9oU=vpuymc1fyrwpextpmpme-ir0cjwt...@mail.gmail.com%3E > There are some discussions on appending/augmenting the existing > annotated/training data[2]. I think the short answer is that there is > currently no easy way short of having to sign DUA's from every single > source institution. > > [1] http://svn.apache.org/r1465043 > [2] > > http://mail-archives.apache.org/mod_mbox/ctakes-dev/201412.mbox/%3CE5A > 9fa5abbf1ca4085d4f0794852a51e24241...@chexmbx3a.chboston.org%3E On Wed, Nov 11, 2015 at 3:51 PM, Posada Aguilar, Jose David <josepos...@pitt.edu> wrote: > Dear cTAKES community > > I want to know if it's possible to obtain the annotated corpus that were used > to test cTAKES. > > We are currently using it and we would like to be able to test each module > towards the addition of a new one. > > Thank you very much for your help. > > > > Jose Posada > Department of Biomedical Informatics > University of Pittsburgh > >
Re: cTAKES corpus
Hi Jose, There were some previous discussions[1] on how to get the annotated training data. Essentially, there currently isn't a centralized or easy way of getting w/o having to sign individual Data Use Agreements from source institutions. There is a clear need to simplify this and I believe the various groups are working on it... [1] http://mail-archives.apache.org/mod_mbox/ctakes-dev/201503.mbox/%3CCA+Fyf6hxBbhhEqc9oU=vpuymc1fyrwpextpmpme-ir0cjwt...@mail.gmail.com%3E > There are some discussions on appending/augmenting the existing > annotated/training data[2]. I think the short answer is that there is > currently no easy way short of having to sign DUA's from every single > source institution. > > [1] http://svn.apache.org/r1465043 > [2] > > http://mail-archives.apache.org/mod_mbox/ctakes-dev/201412.mbox/%3ce5a9fa5abbf1ca4085d4f0794852a51e24241...@chexmbx3a.chboston.org%3E On Wed, Nov 11, 2015 at 3:51 PM, Posada Aguilar, Jose Davidwrote: > Dear cTAKES community > > I want to know if it's possible to obtain the annotated corpus that were used > to test cTAKES. > > We are currently using it and we would like to be able to test each module > towards the addition of a new one. > > Thank you very much for your help. > > > > Jose Posada > Department of Biomedical Informatics > University of Pittsburgh > >
cTAKES corpus
Dear cTAKES community I want to know if it's possible to obtain the annotated corpus that were used to test cTAKES. We are currently using it and we would like to be able to test each module towards the addition of a new one. Thank you very much for your help. Jose Posada Department of Biomedical Informatics University of Pittsburgh