UIMA on Spark mimicking CPE pipelines

Benedict Holland Tue, 26 Sep 2017 12:02:47 -0700

Hello all,

I have a working application that essentially implements the CPE within a
spark context. The best part about this is that it does not use UIMAFit or
any 3rd party applications. It simply uses hadoop, spark, UIMA, and
OpenNLP.


Users are able to configure, design, and build the UIMA pipeline using all
of the eclipse XML plugin applications. Instead of running the application
via the CPE.process() driver from a main class, it will run from the
foreach() function on the Dataframe<Row> object.

Oh also, it plugs into a database to get the text and to write results.

Would the UIMA community be interested in getting a working example put
together? If so, please feel free to contact me. I think this could be an
excellent example of what people would like to use and your examples are
particularly good.

Thanks,
~Ben

UIMA on Spark mimicking CPE pipelines

Reply via email to