Guys have you sent this to [email protected]? I’m sure they would love to hear how you guys are using Spark!
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Pei Chen <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Thursday, December 11, 2014 at 9:16 AM To: "[email protected]" <[email protected]> Subject: Re: SparkStreaming -> CTakes -> Cassandra ETL. >Jay, >This is very cool. >Let's plan on demo'ing this for the next ApacheCon... >--Pei > >On Wed, Dec 10, 2014 at 7:26 PM, jay vyas ><[email protected]> wrote: > >Hi folks.. Just an FYI for those interested in running CTakes in a >BigData context. > > >Ive been working on using CTakes inside Apache BigTop, so that you can do >big data stuff with the CTakes API. > > >I rewrote the CTakes spark streaming demo here: > > >https://github.com/jayunit100/SparkStreamingCassandraDemo/tree/master/src > > > >It exemplifies how to stream data using spark, from twitter, and then >process it with CTakes, as well as how to Ultimately forward the results >into Cassandra as well. > > >Its a work in progress, but feel free to grab it as a template if looking >to integrate all these APIs. > > >ill brush of my SVN credentials and commit it to directly to CTakes as an >update to the streaming example in sandbox/ that is already there. > > >https://svn.apache.org/repos/asf/ctakes/sandbox/ctakes-spark-streaming-twi >tter/ > > > >-- >jay vyas > > > > > > > > > >
