Hi John, There have been a couple of presentations at the recent ApacheCon: https://www.apachecon.com/acah2020/tracks/ctakes.html
Cheers, Gundolf. From: John Doe <lucanus...@gmail.com> Reply-To: "user@ctakes.apache.org" <user@ctakes.apache.org> Date: Tuesday, November 17, 2020 at 08:48 To: "user@ctakes.apache.org" <user@ctakes.apache.org> Subject: Scaling out cTAKES Hello, I'm new to cTAKES and was wondering what the options are for scaling out the default clinical pipeline. I'm running it on a large number of clinical notes using runClinicalPipeline.bat and specifying the input directory with the notes. What are the best options for doing this in a more scalable way? For example, can I parallelize it with UIMA-AS? Or should I manually use multiple command prompts to run the clinical pipeline on a different set of clinical notes in parallel? I'm not sure if there is any build-in solution or community resource which uses EMR/Spark or some other method to achieve this. Thank you for your help.