Hi Bandeep, my team has done some work here, see: Spark / cTAKES – Giuseppe Totaro https://github.com/giuseppetotaro/ctakes-clinical-pipeline
UIMA/DUCC/cTAKES – Yi-Wen Liu https://github.com/yiwenliuable/ctakes-scale-out-with-uima-ducc UIMA/DUCC/cTAKES – Selina Chu https://github.com/selinachu/DUCC-cTAKES-AWS Comments + Feedback welcome. Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect, Instrument Software and Science Data Systems Section (398) Manager, Open Source Projects Formulation and Development Office (8212) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov<mailto:chris.a.mattm...@nasa.gov> WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ From: Bandeep Singh <bsi...@phemi.com> Reply-To: "user@ctakes.apache.org" <user@ctakes.apache.org> Date: Friday, August 12, 2016 at 10:05 AM To: "user@ctakes.apache.org" <user@ctakes.apache.org> Subject: How to use cTakes with SPARK Hi Team, I am very new to cTAKES and just started learning how to use it. I am wondering how to use cTakes API with SPARk (pyspark preferably) for Big data. Can somebody point me in the right direction. Till now I downloaded cTakes jars and tried building it with SPARK, but it threw me some resource allocation exception. Any response will be highly appreciated. Thanks, Bandeep