Re: API for running ctakes programatically

vijay garla Wed, 18 Jun 2014 07:17:37 -0700

To Annotate:
If you have a CPE, and all the components in your pipeline are threadsafe
(i.e. drop LVG from your pipeline), you can increase the threads in the cpe
config
You can use this class: org.apache.ctakes.ytex.tools.RunCPE to run a cpe
from the command line/script

Alternatively, run multiple CPE's in parallel (they need to be processing
different subsets of the corpus)

To extract annotations:
Add the YTEX DBConsumer to store the annotations in a database (see
https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+3.1.2+-+YTEX+DBConsumer
)
Make sure you configure 'types to ignore' - you don't want to store
annotations for punctuation.

You can add the DBConsumer to any pipeline/CPE - you don't need any other
YTEX components (however, you do have to set up a database).

On Wed, Jun 18, 2014 at 4:56 AM, Richard Eckart de Castilho <[email protected]>
wrote:

> A Groovy script has been mentioned on the developers list that illustrates
> how to use uimaFIT to compose and run a cTAKES pipeline. [1]
>
> I do not know if these scripts are only in SVN or if they are (planned to)
> be part of a release or of some documentation.
>
> Cheers,
>
> -- Richard
>
> [1]
> http://mail-archives.apache.org/mod_mbox/ctakes-dev/201312.mbox/%3c996fc801c05df64a84246a106facacd021a...@msgpexcha08a.mfad.mfroot.org%3E
>
> On 18.06.2014, at 07:33, Abhishek Raj <[email protected]> wrote:
>
> > Hello. I have been looking for a way to run ctakes programatically to
> annotate large number of documents and extract those annotations. I haven't
> come across any docs so far which explains how to do that. If someone could
> throw some light on this issue, it'd be great. Thanks! :)
>
>

Re: API for running ctakes programatically

Reply via email to