Guys have you sent this to d...@spark.apache.org? I’m sure they
would love to hear how you guys are using Spark!
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion
What about using Apache Tika within cTAKES for this? Tika supports
OCR through Tesseract:
http://wiki.apache.org/tika/TikaOCR
Cheers,
Chris
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems
might be super useful.
On Jun 6, 2015 8:19 PM, Mattmann, Chris A (3980)
chris.a.mattm...@jpl.nasa.gov wrote:
Hey cTAKES peeps!
We went ahead and integrated Tika with cTAKES for a project I'm
working on at JPL. It will be part of the 1.9 release of Tika. You
can check it out here:
https
+1 really interested in the reply to this :)
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
resting. could you share some of your clinical use
>cases ?
>
>On Sep 19, 2015 9:27 AM, "Mattmann, Chris A (3980)" <
>chris.a.mattm...@jpl.nasa.gov> wrote:
>>
>> Hey Folks,
>>
>> If anyone wants to discuss how JPL can contribute to a BD2K
>&
Hey Folks,
If anyone wants to discuss how JPL can contribute to a BD2K
proposal or if someone is already leading it, I would definitely
be interested in contributing. We have been combining Tika, cTAKES,
and other software (Spark, OODT) into a solution we call “Shangridocs”
which could be quite
Also if folks are interested I have had a lot of luck with GROBID
in collaborative work with P. Lopez (http://github.com/kermit2/grobid/).
Would be happy to talk more.
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and
Hi Bandeep, my team has done some work here, see:
Spark / cTAKES – Giuseppe Totaro
https://github.com/giuseppetotaro/ctakes-clinical-pipeline
UIMA/DUCC/cTAKES – Yi-Wen Liu
https://github.com/yiwenliuable/ctakes-scale-out-with-uima-ducc
UIMA/DUCC/cTAKES – Selina Chu