Re: SparkStreaming - CTakes - Cassandra ETL.

2014-12-11 Thread Mattmann, Chris A (3980)
Guys have you sent this to d...@spark.apache.org? I’m sure they would love to hear how you guys are using Spark! ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion

Re: Image to text conversion

2015-04-29 Thread Mattmann, Chris A (3980)
What about using Apache Tika within cTAKES for this? Tika supports OCR through Tesseract: http://wiki.apache.org/tika/TikaOCR Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems

Re: Integration of Tika with cTAKES

2015-06-07 Thread Mattmann, Chris A (3980)
might be super useful. On Jun 6, 2015 8:19 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Hey cTAKES peeps! We went ahead and integrated Tika with cTAKES for a project I'm working on at JPL. It will be part of the 1.9 release of Tika. You can check it out here: https

Re: Paragraph Chunking in cTAKES

2015-09-23 Thread Mattmann, Chris A (3980)
+1 really interested in the reply to this :) ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527

Re: [cancer-informatics] Fwd: BD2K Coordination Center Solicits Hackathon Proposals

2015-09-24 Thread Mattmann, Chris A (3980)
resting. could you share some of your clinical use >cases ? > >On Sep 19, 2015 9:27 AM, "Mattmann, Chris A (3980)" < >chris.a.mattm...@jpl.nasa.gov> wrote: >> >> Hey Folks, >> >> If anyone wants to discuss how JPL can contribute to a BD2K >&

Re: [cancer-informatics] Fwd: BD2K Coordination Center Solicits Hackathon Proposals

2015-09-19 Thread Mattmann, Chris A (3980)
Hey Folks, If anyone wants to discuss how JPL can contribute to a BD2K proposal or if someone is already leading it, I would definitely be interested in contributing. We have been combining Tika, cTAKES, and other software (Spark, OODT) into a solution we call “Shangridocs” which could be quite

Re: update on temporal relations

2016-06-24 Thread Mattmann, Chris A (3980)
Also if folks are interested I have had a lot of luck with GROBID in collaborative work with P. Lopez (http://github.com/kermit2/grobid/). Would be happy to talk more. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and

Re: How to use cTakes with SPARK

2016-08-12 Thread Mattmann, Chris A (3980)
Hi Bandeep, my team has done some work here, see: Spark / cTAKES – Giuseppe Totaro https://github.com/giuseppetotaro/ctakes-clinical-pipeline UIMA/DUCC/cTAKES – Yi-Wen Liu https://github.com/yiwenliuable/ctakes-scale-out-with-uima-ducc UIMA/DUCC/cTAKES – Selina Chu