Hi Bandeep, my team has done some work here, see:

Spark  / cTAKES – Giuseppe Totaro
https://github.com/giuseppetotaro/ctakes-clinical-pipeline

UIMA/DUCC/cTAKES – Yi-Wen Liu
https://github.com/yiwenliuable/ctakes-scale-out-with-uima-ducc

UIMA/DUCC/cTAKES – Selina Chu
https://github.com/selinachu/DUCC-cTAKES-AWS

Comments + Feedback welcome.

Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect, Instrument Software and Science Data Systems Section (398)
Manager, Open Source Projects Formulation and Development Office (8212)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov<mailto:chris.a.mattm...@nasa.gov>
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


From: Bandeep Singh <bsi...@phemi.com>
Reply-To: "user@ctakes.apache.org" <user@ctakes.apache.org>
Date: Friday, August 12, 2016 at 10:05 AM
To: "user@ctakes.apache.org" <user@ctakes.apache.org>
Subject: How to use cTakes with SPARK

Hi Team,

I am very new to cTAKES and just started learning how to use it.
I am wondering how to use cTakes API with SPARk (pyspark preferably) for Big 
data.
Can somebody point me in the right direction.

Till now I downloaded cTakes jars and tried building it with SPARK, but it 
threw me some resource allocation exception.

Any response will be highly appreciated.

Thanks,
Bandeep

Reply via email to