RE: cTAKES corpus

2015-11-13 Thread Posada Aguilar, Jose David
Thank you very much for your response

Jose Posada
Department of Biomedical Informatics
University of Pittsburgh

-Original Message-
From: Pei Chen [mailto:chen...@apache.org] 
Sent: Thursday, November 12, 2015 3:34 PM
To: dev@ctakes.apache.org
Subject: Re: cTAKES corpus

Hi Jose,

There were some previous discussions[1] on how to get the annotated training 
data.  Essentially, there currently isn't a centralized or easy way of getting 
w/o having to sign individual Data Use Agreements from source institutions.

There is a clear need to simplify this and I believe the various groups are 
working on it...

[1] 
http://mail-archives.apache.org/mod_mbox/ctakes-dev/201503.mbox/%3CCA+Fyf6hxBbhhEqc9oU=vpuymc1fyrwpextpmpme-ir0cjwt...@mail.gmail.com%3E

> There are some discussions on appending/augmenting the existing

> annotated/training data[2].  I think the short answer is that there is

> currently no easy way short of having to sign DUA's from every single

> source institution.

>

> [1] http://svn.apache.org/r1465043

> [2]

>

> http://mail-archives.apache.org/mod_mbox/ctakes-dev/201412.mbox/%3CE5A
> 9fa5abbf1ca4085d4f0794852a51e24241...@chexmbx3a.chboston.org%3E

On Wed, Nov 11, 2015 at 3:51 PM, Posada Aguilar, Jose David 
<josepos...@pitt.edu> wrote:
> Dear cTAKES community
>
> I want to know if it's possible to obtain the annotated corpus that were used 
> to test cTAKES.
>
> We are currently using it and we would like to be able to test each module 
> towards the addition of a new one.
>
> Thank you very much for your help.
>
>
>
> Jose Posada
> Department of Biomedical Informatics
> University of Pittsburgh
>
>


Re: cTAKES corpus

2015-11-12 Thread Pei Chen
Hi Jose,

There were some previous discussions[1] on how to get the annotated
training data.  Essentially, there currently isn't a centralized or
easy way of getting w/o having to sign individual Data Use Agreements
from source institutions.

There is a clear need to simplify this and I believe the various
groups are working on it...

[1] 
http://mail-archives.apache.org/mod_mbox/ctakes-dev/201503.mbox/%3CCA+Fyf6hxBbhhEqc9oU=vpuymc1fyrwpextpmpme-ir0cjwt...@mail.gmail.com%3E

> There are some discussions on appending/augmenting the existing

> annotated/training data[2].  I think the short answer is that there is

> currently no easy way short of having to sign DUA's from every single

> source institution.

>

> [1] http://svn.apache.org/r1465043

> [2]

>

> http://mail-archives.apache.org/mod_mbox/ctakes-dev/201412.mbox/%3ce5a9fa5abbf1ca4085d4f0794852a51e24241...@chexmbx3a.chboston.org%3E

On Wed, Nov 11, 2015 at 3:51 PM, Posada Aguilar, Jose David
 wrote:
> Dear cTAKES community
>
> I want to know if it's possible to obtain the annotated corpus that were used 
> to test cTAKES.
>
> We are currently using it and we would like to be able to test each module 
> towards the addition of a new one.
>
> Thank you very much for your help.
>
>
>
> Jose Posada
> Department of Biomedical Informatics
> University of Pittsburgh
>
>


cTAKES corpus

2015-11-11 Thread Posada Aguilar, Jose David
Dear cTAKES community

I want to know if it's possible to obtain the annotated corpus that were used 
to test cTAKES.

We are currently using it and we would like to be able to test each module 
towards the addition of a new one.

Thank you very much for your help.



Jose Posada
Department of Biomedical Informatics
University of Pittsburgh