Hi,
Am 02.07.2020 um 00:37 schrieb Jeffrey Hill: > Sean, > I have an experimental integration of cTAKES and RUTA. I feel like it would > be a good “poster” presentation — not really at the “paper” level. Would > something like that be appropriate? I would be very interested in that. Is this integration already available somewhere? Best, Peter > P.S. Are you related to Tim Finan by any chance? > > > ________________________________ > From: Finan, Sean <sean.fi...@childrens.harvard.edu> > Sent: Monday, June 29, 2020 11:02:26 AM > To: dev@ctakes.apache.org <dev@ctakes.apache.org> > Subject: ApacheCon 2020 [Bulk] > > Hi all, > > > General admission to ApacheCon 2020 is free: > https://hopin.to/events/apachecon-home > > > I think that price of admission and travel costs have held back ctakes users > from attending past conferences, and lack of a sizable audience has > diminished the comparative value of ctakes presentations in the eyes of > ApacheCon planners. Because of the "at home" nature of this year's > conference, an app with smaller presence and less hip buzz has a better > chance of grabbing somtime on the schedule. > > > The predetermined tracks are still an ill fit when it comes to the nature of > ctakes. https://apachecon.com/acah2020/cfp.html > > However, I think that we can still use this opportunity to deliver some > powerful introduction and training videos, as well as user stories and > clinical project application. Perhaps we can argue for a NLP track and do > some coordination with projects like OpenNLP and UIMA. > > > There are a scant two weeks to come up with presentations, and less time to > propose a track/topic. The call for presentations ends July 13th. That is a > deadline that requires immediate attention by anybody who wants to show off > their project or expertise. > > > Apache wants to have a single point of contact for each project, and I am > volunteering to be that person for ctakes. I am volunteering, not laying > claim, so if you think that you are a better fit for the position please let > me know. > > > I have written some ideas for presentations below. If you want to take one > (modify as you like) then please write me and post to the devlist. If you > have ideas for another presentation topic, please let me and the devlist know > - even if you aren't volunteering to do the presentation yourself perhaps > somebody else will. Again ... two weeks. > > > Thank you, > > Sean > > > > * The following talk ideas are by and large directed toward training. That > does not mean that topics should stay within that scope. > > > ================================================================= > > > Customizing cTAKES: First Principles > > Built using Apache UIMA, cTAKES is modular and extensible. Why is it > frequently treated as a black box? Is it lack of need, sparsity of > resources, or simply fear of the unknown? > > This is a quick start tutorial on adding custom elements to cTAKES. We > illustrate creating simple classes to input, process and output data. This > involves a concise overview of Apache uimaFIT and the cTAKES type system, as > well as building a UIMA pipeline using piper files. > > > ================================================================= > > > Loading a shippable with cTAKES DockHand > > Customizing a simple pipeline need not be left to cTAKES experts. Making a > cTAKES installation need not be confined to source code checkouts or lengthy > multi-stage binary downloads. > > We introduce cTAKES DockHand, a compact single-file installation tool that > allows one to construct custom pipelines as well as local installations, Rest > Services and Dockerfiles. > > > ================================================================== > > > Secret Engines of cTAKES > > The cTAKES default natural language processing pipeline is a standard in the > clinical research community. What is past that standard? While the default > clinical pipeline uses almost 20 engines, there are dozens more in various > cTAKES modules. > > We present and discuss the top 10 annotation engines you never knew you had. > > > ==================================================== > > > Does cTAKES Know "The Best Words"? > > Named Entity Recognition is at the core of all complete natural language > processing tools. Out of the box cTAKES uses a dictionary containing part of > the Unified Medical Language System (UMLS) that covers most common clinical > terms. But it also comes with a custom dictionary creator. > > If you think that your clinical research is directed, then you should > probably have a directed dictionary. UMLS subsets, non-english dictionaries > and novel custom dictionaries have all been successfully used with cTAKES. > > This is an overview of cTAKES named entity recognition with the essential > what, why and how of custom dictionaries as the centerpiece. > > > ==================================================== > > Academic Software: Performance or Performance? > > A conundrum faced by all academic software projects is how to make the best > of a small amount of resources. Clinical natural language processing > projects that use cTAKES are not exempt, and balancing accuracy of results > against speed of processing often becomes central when it is time to put > things into production (or just please the boss). > > More than a history of cTAKES and its evolutionary efforts in precision, > speed and usability, this presentation contains examples of how to best > utilize each aspect. > > > ================================================================ > > > Diet cTAKES > > One reason cTAKES is a popular framework in clinical natural language > processing tools is its use of Apache Maven for project management. > Navigating cTAKES dependencies can be difficult, leading to a common practice > of consuming the whole project. Much of what ends up in your system may lead > to unnecessary bloat. > > Going piecemeal through the values and weights of cTAKES modules and > resources, this presentation will assist any cTAKES user in trimming project > bulk from gigabytes to megabytes. > > > ================================================================ > > > cTAKES Saved my Life > > The title is inappropriate when it comes to healthcare in practice. However, > I used Apache cTAKES for my clinical research project on ________, and its > [versatile / comprehensive / speedy / ?] nature was important in completing > things [on time / accurately / ?]. > > We share our real-world experiences with using cTAKES, discuss why we chose > it, issues we faced and how we overcame unexpected problems. > > > ================================================================ > > > Large-scale cTAKES, an Installation Story > > At our _____ facility, we needed to process _____ [patients / notes / term > lists / ?] on a ______ system. > > We present a real-world application of cTAKES on a large scale, our needs for > _____ input and ____ output. We compare and contrast cTAKES with other > [clinical] NLP platforms that we tried and explain why we chose [it / > another] in the end. > > We will also share the novel [techniques / code / integration] that we used > for the success of our installation. > > > ================================================================ > > > My Engine is Faster than Yours > > We have created a cTAKES annotation engine that performs the task of _____. > This is [newer / faster / more comprehensive] than existing engines in > [cTAKES / other]. > > We will present [numbers , usage , capabilities / i/o ] of the engine and its > [model / data ]. > We will also commit the code and documentation to Apache cTAKES. > > > ================================================================ > > > cTAKES on the Catwalk > > We have created a Machine Learning model that can be used in cTAKES for > ______. The model uses the third party ______ for [newer / faster / more > comprehensive] results. > > We will present the essentials of model creation as well as [numbers , usage > , capabilities / i/o ] of our model. We will also advocate for the third > party _____ and how we integrated it with cTAKES. > We will also commit the code [model] and documentation to Apache cTAKES. > > > > > > > > -- Dr. Peter Klügl Head of Text Mining/Machine Learning Averbis GmbH Salzstr. 15 79098 Freiburg Germany Fon: +49 761 708 394 0 Fax: +49 761 708 394 10 Email: peter.klu...@averbis.com Web: https://averbis.com Headquarters: Freiburg im Breisgau Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080 Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó