I haven’t tried this but maybe have a look here: 
https://svn.apache.org/repos/asf/ctakes/sandbox/ctakes-spark-streaming-twitter/

Thanks,
Brandon

From: Bandeep Singh [mailto:[email protected]]
Sent: Friday, August 12, 2016 3:16 PM
To: [email protected]
Subject: [External] Re: How to use cTakes with SPARK

Hi,

Thanks for your reply !

But I am wondering if somebody has run cTakes using SPARK and actually succeed. 
If yes, some resources/examples would be really helpful.

Budha I tried building SPARK with cTAKES, however the when i execute a sime 
HelloWorldAnnotator.java function it threw exceptions, which is suspect is 
because the example was written a long back and doesn't comply any more with 
the current libraries.

Thanks Again,
Bandeep

On Fri, Aug 12, 2016 at 10:21 AM, buddha 
<[email protected]<mailto:[email protected]>> wrote:
cTAKES is a Java project, so it should work “out of the box” with the Java 
Spark libraries.  If you’re not used to using Spark + Java, then I would not 
recommend starting with cTAKES.  I suggest you start by using cTAKES as a Maven 
dependency alongside the Spark Maven dependencies.

If you want to use pySpark, then you are in the business of using Java libs 
from Python, like in 
http://stackoverflow.com/questions/476968/using-a-java-library-from-python<https://na01.safelinks.protection.outlook.com/?url=http%3a%2f%2fstackoverflow.com%2fquestions%2f476968%2fusing-a-java-library-from-python&data=01%7c01%7cbdgeise%40geisinger.edu%7cb2d65ee9cb8f4b32d8df08d3c2e5284b%7c37d46c567c664402a16055c2313b910d%7c0&sdata=yVV3Va5%2ba9NPQM%2fBU2gVhl6NqdAwmbtCEc%2bgwix1WkA%3d>
 and there is nothing special about cTAKES.

cTAKES uses UIMA on the backend, and this can be extremely confusing to new 
users.  Maybe you should isolate your problems

1. Use Spark + Java libs
2. Use Python + Java libs
3. Learn cTAKES on it’s own turf.  Namely, Java

Apache projects notoriously have dependency problems, and Spark is no 
exception.  HA!  “Exception”-- I’m funny.  Anyway, don’t expect the two to play 
together nicely at first.

b

~~~~~
May All Your Sequences Converge

On Aug 12, 2016, at 10:05 AM, Bandeep Singh 
<[email protected]<mailto:[email protected]>> wrote:

Hi Team,

I am very new to cTAKES and just started learning how to use it.
I am wondering how to use cTakes API with SPARk (pyspark preferably) for Big 
data.
Can somebody point me in the right direction.

Till now I downloaded cTakes jars and tried building it with SPARK, but it 
threw me some resource allocation exception.

Any response will be highly appreciated.

Thanks,
Bandeep




IMPORTANT WARNING: The information in this message (and the documents attached 
to it, if any) is confidential and may be legally privileged. It is intended 
solely for the addressee. Access to this message by anyone else is 
unauthorized. If you are not the intended recipient, any disclosure, copying, 
distribution or any action taken, or omitted to be taken, in reliance on it is 
prohibited and may be unlawful. If you have received this message in error, 
please delete all electronic copies of this message (and the documents attached 
to it, if any), destroy any hard copies you may have created and notify me 
immediately by replying to this email. Thank you.

Geisinger Health System utilizes an encryption process to safeguard Protected 
Health Information and other confidential data contained in external e-mail 
messages. If email is encrypted, the recipient will receive an e-mail 
instructing them to sign on to the Geisinger Health System Secure E-mail 
Message Center to retrieve the encrypted e-mail.

Reply via email to