Hi Benedict, I'd be very interested to see an example of this, as we've been playing with the very same idea, but haven't yet gotten to any actual trial (and error) yet.
Many thanks in advance, benjamin Benjamin De Boe Product Manager | InterSystems T: +32 2 464 97 33 | M: +32 495 19 19 27 http://www.intersystems.com/ -----Original Message----- From: Benedict Holland [mailto:[email protected]] Sent: Tuesday, September 26, 2017 9:02 PM To: [email protected] Subject: UIMA on Spark mimicking CPE pipelines Hello all, I have a working application that essentially implements the CPE within a spark context. The best part about this is that it does not use UIMAFit or any 3rd party applications. It simply uses hadoop, spark, UIMA, and OpenNLP. Users are able to configure, design, and build the UIMA pipeline using all of the eclipse XML plugin applications. Instead of running the application via the CPE.process() driver from a main class, it will run from the foreach() function on the Dataframe<Row> object. Oh also, it plugs into a database to get the text and to write results. Would the UIMA community be interested in getting a working example put together? If so, please feel free to contact me. I think this could be an excellent example of what people would like to use and your examples are particularly good. Thanks, ~Ben
