Hello all, I have a working application that essentially implements the CPE within a spark context. The best part about this is that it does not use UIMAFit or any 3rd party applications. It simply uses hadoop, spark, UIMA, and OpenNLP.
Users are able to configure, design, and build the UIMA pipeline using all of the eclipse XML plugin applications. Instead of running the application via the CPE.process() driver from a main class, it will run from the foreach() function on the Dataframe<Row> object. Oh also, it plugs into a database to get the text and to write results. Would the UIMA community be interested in getting a working example put together? If so, please feel free to contact me. I think this could be an excellent example of what people would like to use and your examples are particularly good. Thanks, ~Ben
